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integral equations in the sciences and elsewhere, the evaluation of 
integrals is usually vital. Next, continuous linear functionals and 
self-adjoint (hypermaximal) linear operators over spaces of functions 
are represented in spectral theory and elsewhere by Stieltjes-type 
integrals. Finally, integrals are used for distributions in statistics, 
and these are in turn often used to approximate certain sums. For 
example, binomial sums are approximated by using suitable normal (or 
Gaussian) integrals. 

Students usually begin with the calculus inverse of the derivative, 
the indefinite integral of Newton, and then turn to the definite integral 
of Leibnitz, Cauchy, Riemann and Darboux, to name the principal 
mathematicians involved. These simple integrals satisfy many; much is 
done in physics and applied mathematics by using the calculus alone, 
without recourse to more modern ideas. However, it is becoming 
increasingly clear to scientists in general, especially those interested 
in statistics, statistical physics, or quantum theory, that such a 
limitation of the pure mathematical tools is a restriction, and that the 
use of a more powerful integration process will result in a greater ease 
of application. 

Lesbesgue integration began in a famous paper of 1902, has several 
useful and interesting properties, and is more powerful in some directions 
than calculus integration. But a student of its theory endures a complete 
change of ideas. One approach needs a theory of measure to construct 
the integral. In another approach the integral is defined directly as a 
continuous linear functional. But, whatever the approach, the theory is 
formidable and many find it difficult to understand. Yet the evaluation 


of almost all Lebesgue integrals uses calculus methods. 

This curious paradoxical situation often irritated the author when 
researching in Lebesgue and allied integration theory from 1943 to 1958. 
Further, even in one dimension it is difficult to estimate a basic element 
of Lebesgue theory, the measureof the union of open neighbourhoods of known 
lengths of members of an everywhere dense sequence of points, even when 
the lengths and points have easy formulae. There is much overlapping of 
neighbourhoods. 

A useful tool for mathematicians and scientists needing advanced 
integration theory would be a method combining the ideas of the calculus 
indefinite integral and the Riemann definite integral in such a way that 
Lebesgue properties can be proved easily, so that the student would not 
have such a radically fresh start as is needed for Lebesgue integration. 
Before the 1950's it would have seemed to be an impossible dream. But 
independently J. Kurzweil (1957) and the author (1955, 1961) arrived at 
the same construction, a simple extension of the construction of the 
Riemann integral. The resulting integral includes the Lebesgue and Denjoy- 
Perron integrals and is the subject of this book. It is one of a whole 
family of integrals definable by using the Riemann sums over divisions 
with a limit process that is stronger than the limit as the mesh tends to 
0. The theory has advanced considerably in the 25 years since writing 
the first book, which is now out of print. Comparison of that pioneering 
book with the present book shows how proofs have simplified considerably. 
Further, conditions have been found that are both necessary and sufficient 
for a sequence of integrable functions to have an integrable pointwise 
limit, and for the limit to be taken inside the integral sign. The 
discrete variable can be replaced by a continuous variable, so that 
necessary and sufficient conditions are found for differentiation and 
integration under the integral sign. Thus Sections 11, part of 13, 14, 15 
in part, and 18 are new, while in Section 19 the Carathéodory theory of 
ordinary differential equations is improved. 

I wish to express my appreciation to Ms. Terri Moss for the excellent 
typescript, and to many colleagues and students who have checked the lecture 
notes and have helped to produce a better theory. 


R. Henstock 
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0 Prerequisites 


We begin with the usual notation of set theory. A collection of 
objects is sometimes called a set and sometimes a family, these two terms 
being interchangeable. If an object x is a member of a set X we write 
x €X. Sing (x) is the set whose only member is x . If every member of 
X is also a member of a set Y we write X CY and Y>X. If X and Y have 
the same members, X © Y and Y € X, we regard X and Y as being the same set 
and write X = Y. Otherwise we write X # Y and say that X,Y are distinct. 
If X CY, X#Y, we write X ©Y, YDX. If Misa family of sets X we 


write, 


for the union and intersection, respectively, of sets X of M. The first 
is the set of all objects each of which lies in at least one of the sets 
X of M. The second is the set of all objects each of which lies in all 
sets of M. Usually M is omitted from the symbols and there are other 
slight variations. For instance, if M is the collection of Ke for 


j = 1,2,..., we write 


gt T 


Note that these definitions do not need any idea of a limit. The union 

and intersection of two sets X42Xo are for simplicity written as x, U Xos 

X, n Xos respectively. Two sets with no common member, so that the inter- 
section is the empty set, Ø, are called disjoint. If each pair of sets 
from a family M of sets is disjoint, we say that the sets of M are mutually 
disjoint. If X is a set of real numbers the complement >X of X is the set 
of all real numbers that do not lie in X. Thus X n XX is empty and X u NX 
is the set of all real numbers. More generally, XXY denotes the set of all 
objects in X that are not in Y. If (x;) is a sequence of sets then we 


define 
lim sup X; = N u X., lim inf X, = U A Xs 
joo J m j=m J j>% m=1 j=m 
ix 


and if they are the same set we write it as 


lim Xs 


J>% 
There is still no idea of a limit in these definitions, even though lim 


occurs in the names. If X, € X See SX 


1 zG aire ChE: T0 Xan Ue Xn 
s p - 


J j j=1 J 
We say that (x) is monotone increasing. On the other hand, if 

X, > Xo Dee D X; >... then Sa Xj = 2 Xj. We say that (x;) is 
monotone decreasing. A relation T between the objects x of a set X and 
the objects y of a set Y, is called a one-one correspondence if T 
satisfies the following requirements. 


(0.1) If x €X there is a y €Y such that xTy3 
(0.2) if there is az € Y such that xTz, then z = ya 
(0.3) if y € Y, there is an x € X with xTy; 


(0.4) if there is a w € X with wTy then w = x. 


We also say that X,Y are similar. Using this idea we could define the 
positive integers as cardinal numbers, and then proceed to define the real 
and complex number systems. Instead we assume that these systems are 
already given and will only use the idea of similarity to define count- 
ability. A set X is countable if it is similar to the set of all positive 
integers. Then a subset of a countable set is finite or countable. Also 

a finite or countable union of finite or countable sets is finite or 
countable, which is proved by Cantor's diagonal process. But the set of 
all real numbers in 0 <x < 1 is not countable. (For each x in the set 
can be written as a decimal in the scale of 10, say, x = OX Xoe eXe 
where Xh takes an integer value from 0 to 9. Some real numbers have a dual 
representation, for example, 0.50000... = 0.4999... . To remove this 
duality we make the convention that no decimal has x = 9 for all n greater 
than some integer. If (x;) is a sequence of decimals, with 

Xj = DX 5 pXaoX ager Xj we can define a decimal y = 0. YYY 300 Vers by 
the rule 


= = = 8,9), 
Yn = Xan * 1 (0x Xam § 7), Yh | Cini ) 
so that Ya #9. Then y differs from each Xj in at least the jth. decimal 
place and 0 < y < 1. Thus no sequence can contain all x in 0 < x < 1.) 
The rationals (ratios of integers) are countable, for they are a 


distinct subsequence of the sequence 
0,1/15-1/152/1,-2/1,1/2,-1/2,3/1,-3/1,2/2,-2/2,1/3,-1/3,4/1,... 


Or put j/k in sequence by writing j and k as integers in the scale of 10, 
separated by the /, regard / as the integer 10 in the scale of 11, 
interpret j and k now as integers in the scale of 11, and we have an 
integer in the scale of 11. Such an integer cannot be given by any 
other rational. Thus countability follows, and then the uncountability 
of the irrationals (those real numbers that are not rational). 

In a small part of the book we need a little elementary topology. 
An open sphere in n-dimensional Euclidean space R? with centre 
x = (Xj 5+++9X)) and radius r > 0, is the set of all y = (Ygs teyp) 
satisfying 


A set G in R” is open if each point of G is the centre of a sphere 

wholly contained in G. A union and a finite intersection of open sets 
are open. A point x is the limit point of a set X in R" if every sphere, 
centre x, contains a point of X other than x. An isolated point of X 

is a point of X that is not a limit point. The set X' of all limit points 
of X is called the derived set of X, and X u X' is ye the closure of X. 
A set F that contains all its limit points, is called closed. The letter 
F is taken from fermé, French for 'closed'. A set X is dense-in-itself 
if every point of X is a limit point of X (X =X'). A set P that is 
closed and dense-in-itself (P = P') is called perfect, and hence the P. 
The complement of an open set is closed, the complement of a closed set 
is open. An intersection and a finite union of closed sets are closed. 


xi 


A set M is dense in a set X if M SX CM". On the real line all points 
x of X for which, for some e > 0, there is an interval (x-e,x) or an 
interval (x,x+e), or both, free from points of X (i.e. the set Y of 


points of X isolated on one or both sides) is a finite or countable set. 
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CHAPTER 1 - INTRODUCTION 


1. The Riemann And Riemann-Darboux Integrals 


The Greeks used geometry to compute many simple areas, giving rise 
to the method of exhaustions of Eudoxus (c. 408-355 BC) and Archimedes 
(c. 287-212 BC). In this first crude limit process a sequence of non- 
overlapping triangles is fitted into the area to exhaust it. Thus they 
found the areas of the circle and sections of the parabola. But they 
did not define a general non-negative polynomial over a given range, nor 
even graphs, and so could not compute the area under a curve given by a 
non-negative polynomial. 

Graphs were introduced in the 17th. century, possibly by R. Descartes 
(1596-1650) in 1619, Christiaan Huygens mentioned them in a letter to his 
brother Lodewijk on 21st November 1669. The calculus was also invented 
and used by I. Newton (1642-1727) and G.W. Leibnitz (1646-1716) at that 
time. 

However, setting the scene for modern integration, we begin with the 
definite integral on the real line of G.F.B. Riemann (1826-1866) in 1854 
(published 1868) as modified by J.G. Darboux (1842-1917) in 1875. For 
real numbers a < b and the closed interval a <s x < b or [a,b], let 
f : [a,b] >R. We partition [a,b] into smaller intervals by points 


<u_=b. 


Chast) a = ug < Uy < Ug <... < Un- i 


Its mesh is the maximum of uj -Uj | for j = 1,2,...,m. When f 20 in [a,b], 
the smallest rectangle with base Lu; quj] that encloses the part of the 
graph of y = f(x) with x in the base, has height the least upper bound or 


supremum of f there, 


M. = sup {f(x) : 


x < < 7 
j Mad APS NG 


The collection of m rectangles, for j = 1,2,...,m, has area 


m 
Us g M; (uju ), 


gal 


the upper Darboux sum for the partition. Similarly the rectangle with 
base [us yous] that lies just below the graph, has height the greatest 


lower bound or infimum 
mj = inf {f(x) : Us; 4 $X Sus}. 
The area of this collection of rectangles, for j = 1,2,...,m, is 


1; 


Eom 
L= Z muus 
jai Í i jt 
the Lower Darboux sum for the partition. If f is sometimes negative in 
[a,b] the m., Mas L,U can still be defined in this way, but of course 
the geometric picture changes slightly. Clearly U > L. If these sums 
tend to the same limit I for arbitrary partitions (1.1) of [a,b] as the 
mesh tends to 0, we say that the Riemann-Darboux integral of f exists 
over [a,b] with value I. Then U and L have to be finite for some 


partition, and 


(1.2) tf f ts Riemann-Darboux integrable over [a,b] then f ts 
bounded there. 


Riemann's 1854 definition (published 1868) is different. For f and 
(1.1) as before, let Xj be an arbitrary point of Cu; _, U5] C pee.) 
with 


m 
(1.3) s= 3 f(x.) (u.-u. 1)» = (E) zf(x)(v-u). 


The division E is the collection of (Lu; _ 445) x5)(3 = 1,...,m) used in 
(1.3), its mesh (E) is the mesh of (1.1), and (E) = denotes summation 
over E. If every such s tends to a fixed number I as mesh (E) > 0, we 
say that the Riemann integral of f exists over [a,b] with value I, and 
we write I = (R) fdx. More precisely: 

[a,b] 
(1.4) given e >0,a6>0 depending on e is such that tf mesh (E) < & 


then |S-I| < e. 


If L and U tend to the same number I as mesh (E) > 0, so does s 
since L <s < U. Conversely, in (1.3) with a fixed partition (1.1), for 


e > 0, there are Xqoeee oXpp with 


Ms 2 f(x.) > Mj-elj = 1,...m), U 2 S > U-e(b-a), 
b-a being finite. For another choice of Xyaeee sXe 


m 


s sL + ¢(b-a). 


3 
l, 
=$ 
~ 
x< 
rae 
A 
3 
+ 
m 
~ 
a 
i 
as 
Ş 
my 
= 
= 
v 
= 
A 


As e > 0 is arbitrary, if s + I as mesh (E) + 0, then L > I, U > I. Thus 
by (1.2), 


(1.5) tf f is real valued, the Riemann and Riemann-Darboux 
definitions yteld the same results, and an integrable 


funetton ts bounded. 


A.L. Cauchy (1789-1857) gave a similar definition in 1821 in which 
Me = Ue Gh sesasi 
Riemann (1868) art. 5, shows that 


(1.6) the necessary and sufficient condition for the Riemann 
integral of a bounded function to exist over [a,b], ts 
that the total length of the subintervals for which the 
oscillation is greater than any fixed positive number, 


ts arbitrarily small. 


Those already knowing Lebesgue theory can note that here appears the idea 
of a Riemann-integrable function being bounded, and continuous almost 
everywhere, though not expressed in the Lebesgue notation. 

We can take ey SU OF Ke = Be ga A choice for each j in faaea: 
For if uj} < Xj < Uj We can replace [us 4 sus] by Cu5_4 XJ and [x;y] 
while the sum is not altered since 


F(x; )(uj-us_4) = F(x 5) (xju; 4) + f(x) u 5 xj). 


Ex. 1.1. Evaluate 


tim (1/(5241) + 2/(5° 44) + 02. + jG) 


jroo 
(University of Ulster 1986, M 112). 
Hint: write the expression inside the main brackets as a sum of 
ogy + (rg DTL, for r = 1,...,j, a Riemann sum to integrate 
xla y | from 0 to 1. 


Ex. 1.2. Similarly find 


J SRE 
Imo gaa a, 


J>% k=1 
(New University of Ulster 1975, M 111). 


Ex. 1.3. IfO0<rx<1and if f(0) = 0 and f(x) = rr <x < Ay 
m= 1,2,..., prove that f is Riemann integrable over 0 < x < 1 with value 


r/(1+r), even though f has an infinity of discontinuities. 


(New University of Ulster 1972, M 112) 


Ex. 1.4. Let (s;) be a not necessarily monotone sequence of points in 
[0,1] with infimum s, and put 


f(x) = (sy <x) £83.57, F(x) = 0 (x 5). 


Prove that the Riemann integral of f exists over [0,1] with value 


S.(1-s.)j 
z jí ps 


(New University of Ulster 1972, M 213) 


2. Modifications Using The Mesh And The Refinement Of Partitions 


The Riemann-Darboux definition depends on there being a reasonable 
order in the space of values of f, and can be extended to functions with 
values in lattices. But it can only be extended to complex-valued 
functions by integrating the real and imaginary parts separately, which 


causes complications. The Riemann definition, however, is far more 
versatile, it can be used for functions with values in lattices, the 
complex plane, the set of quaternions, and even a topological group. 
Again, if for real-valued or complex-valued functions we replace sums 
by products, we have a Riemann product integral, see J.D. Dollard and 
C.N. Friedman (1979) which gives many references. 

T.J. Stieltjes (1856-1894) in 1894 modified (1.3), using another 
function g and replacing f(x)(v-u) by f(x){g(v)-g(u)}, and replacing s 
by 


m 
(2.1) S =} F(x; ){g(uj)-glus_4)} = (E) z F(x) {g(v)-g(u)}. 
= 


The resulting integral as the mesh tends to 0, is called the Riemann- 


Stieltjes integral and is written (RS) f dg. If g(v)-g(u) always 
[a,b] 
remains of the same sign we can define a Riemann-Darboux-Sttieltjes 


integral when necessary. 

J.C. Burkill (1924) replaced f(x)(v-u) by a function h(u,v) of the 
interval from u to v, or an interval function, obtaining the Burkill 
integral. 

The author uses an interval-point function h({u,v),x), and in 
Henstock (1963), p. 26, one finds {hy shy} where h (uv) = h({u,v),v), 
h(usv) = h({u,v),u). 

If a real- or complex-valued function g, defined on [a,b], is such 
that for some positive number M, 


m 
wie 4 <M 

. Ig(u;) - glu;_4)| < 

for every partition (1.1) of [a,b], we say that g is of bounded variation 
on [a,b]. The smallest value of M is the variation var(g;[a,b]) of g. 
Then g is bounded, since 


Ig(x)| < |g(a)| + |g(x) - gla)| + |g(b) - g(x)] < [glad] +M. 


Theorem 2.1: If f ts continuous and g of bounded variation on [a,b], then 


f is Rremann-Stieltjes integrable relative to g on [a,b]. 


Proof: 
that 


(2.2) 


Let E be the division in (1.3) with mesh (E) the mesh of (1.1), and let 


By uniformity of continuity, given e > 0, there is a 6 > 0, such 


lf(v) - f(u)| <e (asu<v 


Sb, v-u < 26). 


Ei be the special division consisting of (Cv, qY) v) (3 = “Lemon bd 


obtained by using yat (b-a)j/p (j = 


intersections of every overlapping pair of intervals from (1.1) and Ep 


0,1,...,p). We consider the 


and use (2.2), taking mesh (E) < 6, b-a < pô. If 


[f(x5){g(t)-g(s)}-F(vy fg(t)-g(s)}]= [#(x;)-F(y,)[19(t)-9(s) |selglt)-g(s)], 


Cuj 044 n [vk-1>Yk] = [s,t], 


I(E)Z F(x) {g(v)-g(u)} - (E Ef (x) {g(v)- 


Taking E = Eg with q§ > b-a we see that 


((E,) z (x) {g(v)-g(u) }) 


then Ix; - Vv, | < 26, 


glu) se 2 |g(t)-g(s)| se M. 


is a fundamental sequence and so tends to a limit I, and then for 


arbitrary E with mesh less than §, and p > œ, the result follows from 


|(E) z f(x){g(v)-g(u)} - I] < eM. 


Theorem 2.2: As mesh (E) > 0 let 


If f is Riemann-Stieltjes integrable relative to g, then g ts Riemann- 


(E) z | (F(v)-F(u))(g(v)-g(u))| + 0. 


Stieltjes integrable relative.to f and 


(2.3) 


This is the formula for integrating Riemann-Stieltjes integrals by parts. 


Proof: 


(RS) | fag + (RS) Í 
[a,b] 4[a,b] 


The following is an identity. 


gdf = f(b)g(b) - f(a)g(a). 


(2.4) g(x) {f(v)-F(u)} + F(x) {g(v)-g(u)} = F(v)g(v)-Flu)g(u) + 
+ {f(x)-f(u)}{g(x)-g(u)} - ¢#(v)-F(x) }{g(v)-g(x)}. 


In (2.4), all sums over divisions of the first two terms on the right are 
equal to 


f(b)g(b) - f(a)g(a), 


sums of the last two products tend to 0 as mesh (E) + 0, and the sum over 
E of the second term on the left tends to the Riemann-Stieltjes integral 
of f over [a,b] relative to g. Hence g is Riemann-Stieltjes integrable 
over [a,b] relative to f, with (2.3). 


Theorem 2.3: If g is of bounded variation and f continuous on [a,b], 
then g ts Riemann-Stieltjes integrable relative to f on [a,b] with (2.3). 


Proof: From (2.2) and an arbitrary division E of [a,b] with mesh < § , 


(E) z |f(v)-F(u)|-|g(v)-g(u)| < (E) zelg(v)-g(u)| s eM 


and so tends to 0 as mesh (E) + 0. Hence the result follows by Theorems 
Paila Aila 

Note that if f is continuous but not of bounded variation, with g of 
bounded variation, one cannot define a Radon or Lebesgue-Stieltjes 
integral of g relative to f, even though the Riemann-Stieltjes integral 
exists. 


Theorem 2.4: f is Riemann-Stieltjes integrable relative to itself and 
BE 2 
(2.5) (RS) fdf = 2f°(b) - f° (a), 
[a,b] 


if as mesh (E) + 0, 


2 


(2.6) (E) z]f(v)-f(u)|? > 0. 


The converse is true when f is real-valued. 


Proof: In (2.4) with g = f, the two terms on the left are equal and the 
sum of F°(v)-#2(u) over E is #°(b)-f2(a). Hence (2.5). For the converse 
take x = v and g = f in (2.4). The next theorem shows that with continuity, 
the boundedness of the variation can in some sense be split up between f 


and g. 


Theorem 2.5: Let f,g be continuous in [a,b] with respective moduli of 
continuity Alh), ulh), (i.e. ath) = sup {|f(y)-F(x) |: [y-x | sh, (x,y) = 
(a,b)}, and similarly for y(h).) If 


M(h) = | Mt)ult) at 
[0,h] 2 
converges for small h > 0, then (RS) | fdg exists. 
[a,b] 
Proof: Writing E* for the division E' op of Theorem 2.1, we prove the 


existence of 


lim Ho» = H, say, where H, = (E5) 5 (x) {g(v)-g(u)}. 


Poo 


For h = v-u > 0 let [u,v) have midpoint y. Then 
\f(v){g(v)-g(u) }-F(y) {g(y)-g(u) }-F(v){g(v)-g(y)}] = 


if(v)-F(y) | |g(y)-g(u)| <  Gh)u(3h). 


Repeating the bisection till [u,v) has a partition P of 2P equal intervals, 


(2.7) |f(v){g(v)-g(u)}-(P) z F(t) {g(t)-g(s)}| < 


p f 4 = 
z 2971 a(h2™ ulh) < 2(v-u)M(v-u), 
j=! 


has, 


since for x > 0 and then x 


IV 


(2.8) f Ault) gp > Abedu(x).x 
Xx t (2x)? 


Hence for q >p, 
IH Hq] s 2(b-a)M((b-a)2P) + 0 


as p > œ, and (Hp) is a Cauchy or fundamental sequence and so has a limit, 
say H, with 


(2.9) |B H] < 2(b-a)M((b-a)2"P). 


For P a partition of m equal intervals with A <m < 2P we extend [u,v) 
to [u,w) by adding 2P-m intervals each of length h/m, with f,g constant 
on [v,w]. Then the new length is 


h + (2P-m)h/m = 2Ph/m < 2h, and f(v) {g(v)-g(u) } = F(w) {g(w)-g(u) }. 
For some § > 0 with 4M(2g) < s, and v-u < §, (2.7) becomes 
(2.10) |f(v) {g(v)-g(u)}- (P) 2F(t){g(t)-g(s)}| <4(v-u)M(2(v-u)) <e(v-u). 


Next, let TyoeeesT, be a partition P* of [a,b) of mesh < 6/3, and by 
continuity of f and g, let n > 0 be such that 


|f(v) {g(v)-g(u)}-F(t) {g(t)-g(s)} <e(b-a)/r (|v-t] <n, fu-s| < n). 


For (b-a)2P < n leta = Ug < Uy <... <UL = b be the partition from 
* : 3 

Ep With each I} associate the interval [u sU) = Jy whose end-points 
are the u's that are nearest to the left and right end-points of 


1j EE A E Sa ty Jj disappears. Then 


r 
|(P*) z f(v){g(v)-g(u)}- : Flu, ){g(u,)-g(u,)}| < elb-a). 
As uu, <$ 3mesh (P*) < §, dividing up those Lu, .u,) with t > s+1, (2.10) 
and (2.9) give 
r 
| 7 F(u,){g(u,)-g(u.)} - Hp! < e(b-a), [HH] < e(b-a), 


|(P*) z f(v){g(v)-g(u)}-H| < 3e(b-a), 
(P*) z f(v){g(v)-g(u)} > H (mesh (P*) + 0). 


Finally we change v to x inu sx sv. From (2.8), if 0 <x < 6, 
4M(2x) < e, Alx)u(x) < ex; 


so that if (15 5x5)(3 = 1,...,r) forma division E of [a,b) of mesh < §/3, 


ICE) = F(x) {g(v)-g(u)}-(P*) 5 F(v) {g(v)-g(u)}] < (P*)zalv-u)u(v-u) 
< e (P*)2 (v-u) = e(b-a). 


Corollary: The proof of the theorem shows that, given e > 0, there is a 

ô > 0, depending on e and the functions i, alone, and independent of the 
particular f,g with respective moduli of continuity not greater than ).y, 
such that 


|(E) z F(x) {g(v)-g(u) }-(RS) | fdg| < e (mesh (E) < 6). 
[a,b] 
Theorem 2.6: If g:[a,b] > R or C, ts of bounded variation in [a,b], g 
has an at most countable number of discontinuities (a,) in [a,b], each 
one of which can be approached from above (except tf aj = b) and below 


(except tf aj = a) by points where g is continuous. Further, there exist 


(2113 g(x-) = lim g(u) (a <x < b), g(x+) = lim g(u) (ag x <b), 
u>x- U>X+ 
(2.12) Fe {|9(a;)-g(a;-)| + |gla;+)-g(a;) |} <M = var(g:[a,b]). 


(2.13) g, (x) is continuous in [a,b], where 


g, (a) = g(a), g(x) = g(x) = z4{g(aj)-glaj-)}-z1g(a;+)-g(a;)}, 


and zy ts over all aj ina xz aj S Ks Zo is over all aj with as a; & X: 


(2.14) If f is continuous, then, except for obvious changes when aj =a 


or b, 


fdg + Z f(a;Hglaj+)-gla;-)}. 


(RS) | 
[a,b] j=l 


fdg = (RS) | 
JEa,b] 
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Proof: For [u,v) the set of all x inu <x <v let (Cu;.v)) be a sequence 


of mutually disjoint intervals in [a,b). Then, for each m, 
m 
[a,b] ~ U 


fUs Vis) 
j=) 4 


J 


is a finite number of intervals, so that by their partitioning, 


(2.15) 


WA 
= 
~ 
5 
I 
a 
v 
N 
v 
~ 


m 
p gv) g gus) | 
and the infinite series is convergent. First take uj =a, Uj = Vj 
(J = 258200))'s vj >c <b. Then 


G = lim g(v;) = g(a) + z {g(v.) = g(v._,)} 


j>» ja d Jol 
exists as an absolutely convergent series. For (ws) also monotone 
increasing to c, g(w;) similarly tends to a limit, say G*. Rearranging 
(vj) and (w.) as a single monotone increasing sequence we find G* = G 
and the limit of g(v,) is independent of the particular sequence (vs), 
and so g(c-) exists. Similarly for g(c+) ina sc <b. Then to prove 
(2.12) we take m points a. in [a,b] and 2m mutually disjoint intervals 
(u.,a.), [a.,v.) in (2.15) (omitting an interval if a = aor b), and so 
obtain (2.12) with m replacing œ. Thus at most mM points have 
|g(a;)-9(a;~) | 2 1/m, and at most mM points have |gla;+)-g(a;)| > 1/m, 
and we can put such points for m = 1,2,... in sequence as (a;). Hence 
the discontinuities are at most countable, and we have (2.12). The series 
in (2.13) are thus absolutely convergent and that result follows. For 
(2.14) we use Theorem 2.1 and only need prove that if f is continuous in 


[a,b], (a;) a sequence of points in (a,b), and a cj is absolutely 
convergent then 

(2.16) z c;la) = (RS) leas fdh, h(x) = Efe; a < ays x), 
and similarly for a < a. < x replacing a <a, < x in the definition of h. 


To prove (2.16), let the division E consist of (Lup 4 Uj) Xp) (k=1,25... ar) 
with mesh (E) < 6, where 
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If(x) - fly)| < ela sx <y sb, |x-y] <6). 


The result follows from 


rd 
) - 3 F(x) {h(u,) - h(uy_4)}| 


f co 
Fs ric [flas)-Flx J]: wy < aj sus e & Ie, | 

Finally the points of continuity are dense in [a,b] since the discontin- 
uities are countable and the points in each interval are not countable, 
so a point of continuity lies in every interval. 

For c > 0 let » : [0,c] + R be monotone increasing. Note that +o is 
not in R, so that we are implying that ọ is finite. Then g is of bounded 
variation as every difference is non-negative, and the moduli of 
differences add up to (c) - 9(0). For convenience we assume that 


g(x) = o(x-) (0 <x sc) 
and we define y: [9(0), o(c)] + [0,c] by 
wly) = inf {x: 0 <x sc, (x) 2 y}. 
We say that y is the inverse function of p, in Young's sense. 


Theorem 2.7: For $,y as given above, yis finite and monotone increasing 


and 


(2.17) (y) = wly-) (9(0) < y < o(c)), 


(2.18)  y(ọ(x)) x} (Os xc). 


inf {z : (z) = g(x), 0 <z 


HA 


(2.19) If also » is continuous at w(y) and (0) < y < o(c) then 
o(ply)) = y. 


Proof: If $(0) <v <y < olc) then y(v) (y) by the monotonicity of 9. 


v) sv 
If for fixed k and some y in 9(0) < y < o(c), wly-) < k < wly), then for 
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every v < y we have y(v) <k, v < o(k) <y. But this is false for 
v = B{p(k) + y} < y. Hence (2.17). For (2.18), wlg(x)) = inf {z: 
o(z) = (x), 0 < z sc} while one value of z is x. Hence the result. 


For (2.19), from x > ly), (x) 2 y and o(wly) +) 2y. Fromx < ly), 
o(x) < y and o(yly)-) <y. By continuity of g we have (2.19). 


Theorem 2.8: For ġ, was before, and O sa <b < c, then 
(2.20) D f xR | ydy = by - ag(a) (9(b) sy sọ(b+)). 
[a,b] Cola) sy] 


Proof: As , y are both monotone increasing and finite, they are of 
bounded variation, and by Theorem 2.3 the two integrals in (2.20) exist. 
It follows that we can use any two suitable sequences of divisions with 
mesh tending to 0, to prove (2.20). Let 


a = ug <U < see SUD = b 


be a partition of [a,b] with mesh less than e. For each u, that lies in 
an interval (v,w] c [a,b] in which (w) = g(v) + but (x) < o(v) for all 
x < V, we replace uj by v. The number of intervals may then drop as 
several uj may correspond to the same v. For the sake of the notation we 
then change the other uj to Vk for some k < j. This change may increase 
the mesh to beyond ge, but the sum over [v.u;], for the u, that are moved, 
is trivially olw)(us-v) so that the increase in mesh can be brought to 
be less than e by introducing extra partition points on the interval. 

We now consider the partition 


(2.21) o(a) = lvo?) < o(v,) <... <olv.) = by. 
By Theorem 2.7 (2.18), 
wlo(v,)) = Yk (k = \eeeusP=1)is 


while from the definition of y, p(y) = b. Thus we obtain 
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p p-1 
t CV YM) + = vlolvy)) olv) 4) uly) (y-o(yy 4) 
p p-1 
= 5 


after much cancellation. The mesh of (2.21) need not tend to 0, because 
of the discontinuities of ». But by Theorem 2.6 (2.13), ọ = ¢ e*s 
where ¢. is continuous and g, is a jump function (fonetion des sauts), a 
sum of o(s+)-g(s) at the jumps s of p. The differences of , can be made 
arbitrarily small by taking the partition of [a,b] of arbitrarily smal] 
mesh, while the effect of each jump g(s+)-@(s) is to introduce an interval 
on which y is constant at the value s. Extra division points can be taken 
on a finite number of such intervals without affecting the sum over 

Cola), y], so that the final division is of arbitrarily small mesh. Hence 
(2.20) follows. 


Theorem 2.9: In Theorem 2.8, if 0s asc, 0= (0) sb < o(c), then 
(2.22) ola) + ¥(b) z ab where ola) = (R) | g(x)dx (a > 0), 0(0) =0, 
[0,a] 


¥(b) = (R) wly)dy (b > 0), ¥(0) = 0. 


n| 

[0,b] 
Equality occurs in (2.22) if, and only if, 
(2.23) (a) < b g ofat). 


Proof: Since $(0) = 0, y is defined in [0,(c)], and the results are 
obvious if a = 0 or b = 0, using Theorem 2.8 (2.20) for the case of 
equality. Ifc 2 x > o(a+), then by definition of y, y(x) > a. Hence, 
using (2.20) repeatedly, if a > 0, b > 0, and b > (at), 
o(a)+¥(b) = o(a)+¥(o(at)) + (R) l ydy > aġ(a+)+a(b-ọ(a+)) =ab. 
JEola+),b] 


If 0< b< g(a), there is au in0 <u <a with 


lu) sb solut), (x) >b (u<xga), o(a)ey(b) = o(u)+v(b)+(R) | b dx 
' [u,a] 
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(Vp (YM gd + RB? vptol Yp) -ol Vp 4 b(y-o(v,_4))=by-agla) 


Hence if equality in (2.22) then (2.23) is true, while if (2.23) is true, 
(2.20) gives equality in (2.22). 


Theorem 2.10: Let X be a set of real numbers and, for each u z 0, let 
g(u,x):X + R be monotone increasing in X with modulus bounded by M 
independent of n. Then there are a sequence uj > o and a monotone 


increasing function g in X, such that 


lim g(u, 5x) = g(x) (x eX). 


Proof: We begin with a lemma. 


Lemma 2.11: Let g(u) be a function of positive real numbers with values 
in [-M,M], for some M > 0. Then for t = lim sup g(u) there is a sequence 


U-+co 


(u,) tending to infinity with g(u;) ti 


Proof of Lemma: The set S(v), of values of g(u) for all u > v, has a 
supremum, say s(v), and if w > v, 


S(w) c S(v), -M < s(w) < s(v) <M. Zet t = inf s(v) (alt v). 
Then t exists in [-M,M] and is the upper limit of g(u) as u >œ. Let 
uy < Uy Ko wea A ujq u, 2 k (k E a 


be defined inductively. Then as s is monotone decreasing there are a 


vj 2 max (uj q9) with t < s(v;) <t+ i, and as s(v;) is a supremum, 
a u. è V, with 
J J 


t-j! < s(vj)-5"! < g(u;) s slv;) < t+ a 


Thus we continue the induction and so prove the lemma. 


Proof of Theorem: Let (x;) be a sequence of points dense in X, such that 


the sequence includes all isolated points of X and all points of X that 
are isolated on one side. Such points can be put in sequence. Then all 
other points of X are approached on both sides by members of (x.). 
Applying the lemma to g(ux,)> there is a sequence (ujq) tending to 
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infinity, such that g(Us4>X4) tends to a limit, say g(x,). Applying the |h; (ux) h; (u,a) | < 2M (J = 1,2). 


lemma to g(U 54 sXp)> there is a subsequence (uso) of (ujq) with G(Uj2 Xp) 

tending to a limit, say g(x), while also G(U sy 9X4) tends to g(x,). This 

is the start of an inductive proof and we have vi glu.) existing, for 
>0o . 

each k, with value g(x) ) > say. Then the diagonal sequence (u.;) is a 


Applying Theorem 2.10, the result is true. for h,(u.x)-h, (usa) and a 
sequence (v.), and then the result is true for ho(usx)-h, (usa) and a 


subsequence (w5) of (v5). As 


subsequence of (uy? for j 2 k, so that for each fixed k, glu; 5 »Xy) +g(x)) g(u,x)-g(u,a) = a(hy-h,), 
as j > %. , : 
As g(u,x) is monotone increasing in x for each fixed u 2 0, glxy) is the result is true for g(u,x)-g(u,a) in x > a. For a further subsequence 


also monotone increasing and lies in [-M,M]. For a point y € X that is it is true in x < a and so true for all x. Then the Corollary is proved 


not an x,, write g(y-) for the limit of g(x, ) as xX) tends to y from below. since 
Then as g(u,x) is monotone increasing in x, glu; sy) 2 Glu 5 Xg)» 


JJ g(u,x)-g(u,y) = g(u,x)-g(u,a)-g(u,y)+g(u,a). 
lim inf glu; 5sy) 2 G(X, )> lim inf g(u, 5s) 2 g(y-). Writing 


j> 00 
i for the limit of sti) 2s Xk tends to y from above, we have Note wie g(u,a) could take arbitrary finite values without Stele 
similarly, lim sup g(u..sy) < (y+). the condition on the variation, and this is why we have to use differences. 
Thus when g(y-) = g(y+), glu; 5 .y) tends to the common value. As ee limit pitonas met uses the mesh tending to 0, causes difficulties 
g(x, ) is monotone increasing, the points y where g(y-) # g(y+) are at for Riemann-Stieltjes integration when the two functions have common 
most countable and so can be put in a sequence (yg). We can now go discontinuities. E.H. Moore (1915) changed this limit process, supposing 
through the same process again for that sequence and prove the theorem that for some number I and each ¢ > 0, there is a special partition 
with a subsequence of (ujj). 
(2.24) a= Vy < Vy < eee < W“ b 
Corollary: If in Theorem 2.10, g(u,x), as a function of x € X, is of 
variation bounded by M independent of u, there are a sequence (u,) with the property that if the partition 
tending to infinity and a function g of x € X, of variation bounded by M 
with Chat) a = Uy <u, < Up <.. <UL < Um 7 P 


we {gu 5x) glu Ne BKK) = Bly. Way is a subpartition of (2.24) (i.e. every Vv, is au, for some j) with 
ujq $ Xj Su, (j = 1,...,m) then the sum s of (2.1) satisfies 


: ¢x<y. Th s [x,y] is equal to [a,y]Na,x), 
Da be Be a ee Se Bias Ee i ý |s_-I| < e. Such a partition (1.1) is called a refinement of the 
partition (2.24), and the integral so defined is called a refinement 

= »)3La,y])-var(g(u,-)3fa.xJ). k 
|glu,y)-glu,x)| < var(g(u,+);[a,y]) gestalt integral or o-integral. This can be generalized to partitions of 


It follows that h,,h, are monotone increasing on the right of a, where arbitrary sets, and the present one-dimensional case is a little stronger 
i than the Riemann-Stieltjes case using the mesh tending to 0. For if the 
hy(u,x) = var(g(u,.);[a,x]) +g(u,x), h,(u,x) =var(g(u,-);[a,x])-g(u,x), mesh of (2.24) is less than some 6 > 0, so is the mesh of any refinement. 

Thus 


and as the variation is not greater than M, 


= 17 


(2.25) if the Riemann (or Riemann-Stieltjes or Burkill) integral exists, 


so does the corresponding refinement integral. 


The improvement made by the refinement integral is that for certain 


i i < b are ignored 
points c ina<c< b, sums using [u,v] witha sgu<c<VS§ g 


i z sums 
and only corresponding sums using [u,c] and [c,v] are used. For 


(24) 
g(v) - g(u) = g(v) - g(c) + g(c) - glu). 


However, f(x) multiplies the differences, and the ranges ae rodeiro 
in the ignored sums from those in the accepted sums. Let oe : oe 
or g continuous, the changes of sum tend to 0 as u-v si i phie 

are discontinuous at the same point, sometimes the refinement in g 


j i le, 
exists but the Riemann-Stieltjes integral does not. For examp 


A 


aaa) pM0sx<1) 


pal 
x 
D 
1l 
m 
a 
-x 
x 
cay 
1 
p 
= 
= 
^ 
x 
WA 
Pe 
ni 


WIERE 2) 
For if 1 is not an end-point of an interval [u,v] 

- = d f(x 
of a partition of [0,2], so that u < 1 < v, then g(v) g(u) s (x) 
takes both values 0,1. For all other intervals of the partition : : 

d 1, whatever 
i is 0. Thus sums take values 0 and 1, 

difference of values of g 1S Y. “a 
the size of mesh, and the Riemann-Stieltjes integral cannot exist. 


can split the partition at 1 in the case of the refinement 
e respective values 


show the difference. 


ever, we 
integral, and for [u,1] and [1,v] we have th 


g(1)-g(u) = 1, f(x) =0, f(x){g(1)-glu)} =0, and 
g(v)-g(1) = 0, f(x){g(v)-g(1)} = 0. 


Hence the refinement integral exists and is 0. 


ivi finemen 
t trivial change renders the re 
go [0,2] does not exist, for the 


t integral] useless. 
An a 


The integral of f relative to itself over 


= 1) or 
only non-zero term is that for [1,v], and f(x) can be 0(x ) 


1(x > 1), while f(v)-f(1) = 1, whatever the size of v-1 > 0. More 
finement integral does not exist when f and g have 


pie ap t, the side lying in the 


discontinuities on the same side of the same poin 
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interval of integration. 

Again, let the rationals be put in a sequence of distinct points as 
(Rds and let F5(x) be 1 for x = Pjast and 0 otherwise. Then over 
any interval the Riemann integral of fj is 0, but the Riemann integral 
and the corresponding refinement integral of the limit function as j > œ, 
do not exist. Thus, although the refinement integral is an improvement 
of the Riemann-Stieltjes integral, the improvement is marginal and 
questions involving limits under the integral sign cannot be tackled 
easily, in the way that the Lebesgue and Radon integrals can do, and so 
we turn to another integral in the next section. 

Other problems in integration theory are connected with unbounded 
functions and unbounded ranges of integration. Neither problem can be 
tackled using the Riemann or Riemann-Stieltjes or refinement integrals 
alone, but it was early seen that the continuity of the integral could 
be used to extend the definitions to cover some of the problems. For 
example, the derivative of x? is ax"? except at x = 0, where there is no 
finite limit for the derivative. But it was thought legitimate to 
integrate ax? to x? + constant even when one end of the range of 
integration is x = 0. 

A.L. Cauchy systematised earlier work, developing what we now call 
the Cauchy limit, using the length function and not the Stieltjes 
difference of values of g. If there is a singularity of the integrand 

f at a point a, in the sense that the Riemann integral does not exist 
over [a,b] for any b > a, and if the limit of the integral over [u,b] 
exists as u + a+, we define the value of the integral to be that limit. 
Similarly if the singularity is at b. If atc € (a,b) we deal with [a,c] 
and [c,b] separately. The integral over [a,o) is similarly defined to be 
the limit of the integral over [a,b] as b > + œ; for (-«,b] we let 
a >- œ; and for (-«, +o) we take the limit as -a and b tend independently 
to to. This is different from the limit of the integral over [-b,b] as 
b tends to +o, which limit occurs in complex variable theory. If the 
limit exists with a,b independent, so does the limit with a = -b. But 
if the last exists, there may be a balance between an integral going to 
+ œ on one side andto- © on the other, which balance would be broken 
when a and b are independent, and the other integral would not exist. 
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Given these definitions, various tests were devised to ensure that 
the Cauchy limits exist. We begin with the case of a singularity at a 


in the interval [a,b]. 


Theorem 2.12: Let 


(2.26) (R) | fdx exist for each u ina < U < b. 
[u,b] 


(2.27) For fixed numbers q, M, v with O<cq<1,M>0,VE (a,b) 


| F(x) | (x-a) < M(a < x s v). 


WA 


Proof: Fora<cu<wWw<vc<b, 


If fdx - (R)| fdx| = I] fdx| < 
[u,b] [w,b] [uw] 


w 
mf M(x-a) dx = [M(x-a)179/(1-a], < M(w-a) 179/(1-q) > 0 
[u,w] 7 


as W > at. 
Hence the result. It is a rather crude test. 


Theorem 2.13: (Comparison test) Let the point function g > 0, with 


numbers M,N in 0 <M<N. 


(2.28) For a v € [a,b] let Mg(x) < |f(x)| < Ng(x) (a < xs v), with 


(2.26) for g replacing f. If (2.26) is true for f and so for 
|f|» the integrals of |f| and g converge or diverge together. 


Proof: Again we can use the general principle of convergence. 


These two tests are for absolute convergence, which gives convergence. 


For more delicate tests we turn to analogues of the Abel and Dirichlet 


tests for infinite series. 


Lemma 2.14: If (2.26) is true for f, and if g is of bounded vartation in 


[u,b], then (2.26) is true for fg replacing f. 
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Proof: By Theorem 2.3 g is Riemann-Stieltjes integrable relative to the 
integral of f over [x,b], which is continuous in x, and this Riemann- 
‘Stieltjes integral is equal to the Riemann integral of -fg, by the 
analogue of Theorem 5.7 (5.25) and since f is bounded in [u,b]. 


Iheorem 2.15: (Abel-Hardy). If the integral in (2.26) converges as 
u » at and tf g ts of bounded variation in [a,b], then the Riemann 


niogral of fg over [u,b] converges as u + a+. 


Proof: In Theorem 2.2, for F(x) the Riemann integral of f over [x,b], 
and its continuity, 


(E) 2 |AF| |Ag| se © |Ag| < eM. 
lence as in Lemma 2.14, 


(R) fadx = (RS eS a 
Pe pee ire gdF tb) =-Oegladetuyk) 


+ RSJ p Fg = g(u){F(u,b)-F(a,b)} + F(a,b)g(b) 


p (RS F(x,b)- , . 
IET (x,b)-F(a,b)}dg 


|F(x,b)-F(a,b)| <e (a <x < Xp). 
lance, the last Riemann-Stieltjes integral tends to a limit as a<u <V SX 


(Rs) | - (Rs) | [s Ies] | < e var(g;[a,b]). 


[u,b] È [v,b] [u,v] 


A slight variation on this proof gives the next theorem. 


Iheorem 2.16: (Dirichlet-Hardy) If F ts bounded as u > a+ and tf g(x) 
of bounded variation in [a,b] and tends to 0 as x + a+, then the 
Homann integral of fg over [u,b] converges as u > a+. 
These tests can easily be translated into applying when the singula- 
Pity occurs at b. 
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Turning to integrals over [a,+ œ) for fixed a, we suppose that 


(2.29) for each fixed a and each b > a, f ts integrable on [a,b] by 


Riemann's method or by using Cauchy limits on Riemam integrals. 


Theorem 2.17: For positive constants C,p,X, let | F(x) | s tx7!-P (x 2X). 


Then the integral of f exists over [a, +o). 
Proof: For F(u) the integral of f over [a,u], if a<u<v, u > X, we 
have 


|F(v)-F(a)| = Fdx | <| cx”! Pax =C(u7P-v7P)/p <cuP/p > 0 


aa [u,v] 


(u > +0), 


giving convergence, 

There are a comparison and Abel andDirichlet tests for [a,o), of 
which a statement of the Abel test is given. The others are easy to 
write by analogy with Theorems 2.13, 2.16. 


Theorem 2.18: (Abel-Hardy). If | j fdx converges and if g is of 
[a , too 


bounded variation in [a.t+m), then fgdx converges. 
J[a, +o) 


If a series converges, its mth. term tends to 0 as m > œ . But the 
function inside an integral convergent over [a,+%) need not have that 
property. For let the graph of f in [m-m 3 mm 3J be the sloping sides 
of a triangle, with f = 0 at the ends of the interval and with the 
triangle vertex at (m,m), for m = 2,3,4,..., and let f be 0 outside those 
intervals. Then the intervals are mutually disjoint. The integral of f 
over the interval with vertex (m,m) is the area m? of the triangle, and 


Z T is convergent. Thus as f is non-negative, (R)| fdx is 
m=2 [0 x) 


convergent. But f(m) = m > œ. 
However, if f = 0 


Theorem 2.19: Let f = 0 and be monotone decreasing in X 2 1. Then 
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and is monotone decreasing, things are more regular. 


m m+1 
(2,30) 2 f(j) 2 Rf f(x)dx 2 z f(j), 
j=1 Ci,m+1] j=2 
m 
(2.31) Os Sn < f(1)-f(m+1), wheres = L fG- Í f(x)dx. 
m 5 
j=1 [1,m+1] 
(2.32) Sp tends to a limit p as m > œ, where 0 < p < f(1)-lim f(x). 
X>% 
(2.33) (The integral test). If f 2 0 is monotone decreasing in x 2 1 


œ 


then © f(j) and | 
j=) [1,+=) 


f(x)dx converge or diverge together. 


Proof: If j <x < j+1 then f(j) 2 f(x) 2 f(j+1). Integrating and taking 
| = 1,2,...,m we have (2.30) and then (2.31). Further, 


-sp = f(m) - (R)| f(x)dx 2 0, 
[m+1 ,m+2] 


and (s,,) is monotone increasing, and bounded by (2.31), and so tends to 


a limit p, giving (2.32). Then (2.33) follows. 


Iheorem 2.20: (Frullani). Let | f(x)/x dx exist by some reasonable 
[A,B] 


definition, for each finite A,B in 0 < A < B. Then for0d<a<b, 


(7.34) f(bx) - f(ax) _ 
ea Te Ji bBJ fO ax- J £00) gy, 


[aA,bA] ~* 


Nua, to study the convergence of the integral over [0, +) we can 
separate the behaviour of f for large x from the behaviour of f for small 
0. 


(2.35) If there is a constant C such that 


f f(x)-C 
d 
laa à i 


viols, then as B > œ, 
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(2.36) | FO) axa c tog, 2. 
[aB,bB] * 
(2.37) In particular this occurs if 
| {f(x)-C}dx 
J [A,B] 


is bounded or tends to a limit as B + œ. 
(2.38) If f(x) + C as x >œ, (2.36) holds. 


(2.39) If there ts a constant K such that 
f OK a 
i A 


ts convergent then 


(2.40) | 
J[aA,bA] * 


(2.41) Also (2.40) occurs if f(x) + K as x + 0+. 


If (2.36) and (2.40) are true then 


f(bx)-f(ax) d 


(2.42) a - 


X 


converges to (C-K)1og (b/a). 


Note that 1/x is of bounded variation in [A,+°) but not in [0,A], so 
that the analogue of (2.37) does not occur for (2.39). 


Proof: Let the first integral in the statement be denoted by F(B)-F(A). 
Then for y = ax and z = bx, 
j f(bx)-f(ax) E l ri iza l 
[A,B] à [bA,bB] [aA aB] Y 
= F(bB) - F(bA) - F(aB) + F(aA) = {F(bB) - F(aB)} - {F(bA)-F(aA)} 
giving (2.34). For (2.35), (2.36) is obvious. Also (2.37) gives (2.35) 
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by Abel's or Dirichlet's tests, as 1/x is of bounded variation. For 
(2.38), if x is large enough, 


i” bB] ftx)-€ dx| s = bB] S dx = € Tog, (b/a), 
i B,bB] x dx = C 1og(b/a). 
a > 


(2.39), (2.40) follow (2.35, 2.36), while (2.41) follows (2.38). Then 
for (2.42) we use (2.34), (2.36), (2.40). 


Some examples involve the Young functions 9, Y. 


Nw, 2.1. Let (sa) be an arbitrary sequence of distinct numbers in (0,1), 
and let 


o(x) = x+ (si <x) z a 


“how that the inverse function y is continuous with derivative lying 
botween 0 and 1. Thus y is the integral of its derivative. 


iw, 2.2. Let x= OX Xoo X be a decimal to base 3, i.e. each 
decimal place Xn can be filled by the integers 0,1,2, but no others. 
‘uch decimals represent the closed interval [0,1]. Let C be the subset 
of all such decimals with Xn # 1 forall m. Show that in a repeated 
trisection of [0,1] the open middle interval in each trisection is 
missing from C. (C is called Cantor's ternary set.) 

For each decimal x in C let y denote the corresponding binary decimal 
OY Yge Yp (i.e. decimal to base 2) where 2Y = xm a i lie acs adhe 
Writing y = g(x) in C and then taking @ constant in each of the open 
intervals that make up [0,1]NC, such that @ is continuous in [0,1], then 
) is monotone increasing from 0 to 1. Find the inverse function y of 9. 


2.8. Let p be finite and monotone decreasing in [0,c], and Tet y be 
tho inverse function of -@. Prove that if 0 <sa<b<c, 


(R) odx - (R) dy = bg(b) - ag(a). 
hen ne $ 
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Ew. 2.4. If c> 0, r> 1, 0< rx <c, and p is finite and monotone 
increasing in [0,c], then 


o(rx) 2 a(x) + (r-1)xg(x) > r(x), (x) = (R)| o(t)dt. 
[05x] 


Ee. 2.5. For (x) = xP"! where p> 1 is fixed, show that y(y) =y'Ap-1) | 
a(x) = xP/p, and y(y) = y/q where q = 1 + 1/(p-1) = p/(p-t) and 

1/p + 1/q = 1. Thus aP/p + b°7q > ab (a >0,b20, p> 1, 1/p+t/q = 1, 
with equality if and only if aP = b9, 


cape ais babel eat (y = 0) ( 1 
Ee. 2.6 For g(x) = i v $ » p(0+) =e 
log.x +1 (x2e l e”! (y> 0) ý 

o (0 <x se!) 


Thus 
(2.43) alog.a + el > ab (a 2 at. b = 0) 


with equality for b = loga +1. If O<a< at, b 2 0, then 


As alog.a has a minimum at a = ra we have (2.43) again, with strict 
inequality. Hence (2.43) is true for a > 0, b > 0. 


Ets. ails (R) | log x dx = [xlog x-x]! = -1-ulog u+u > -1 as u > 0+. 
[u1] € e u e 


f nm 2 tolt Wt 422, 1.2 
Be. G28, we ae dx = [3x Tog .x- 7* Jy =- qo pu log u + qu seg 
as u > 0+ 


(New University of Ulster, 1978, M 112). 


Ex. 2.9. (R)| (1=x2)73 dx = arc sin u - arc sin 0 + łr (u > 1-). 
[0,u] 


[xsin(1/x) J) 


Ex. 2.10. wf pp e - (1/x)cos(1/x))dx 
4 u Fy 


= sin 1 - usin(1/u) + sin 1. For -1 <sinB< +1. 
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fw, 2.11. Consider the convergence and evaluation of 
wf xf (q-x) (x-p)} 2 dx (0 < p< q). 
[p.q 


Ihe use of x = pcos + qsin@e removes the singularity. 
ii, 2.12 Examine the convergence of 


f (1-x)4sin x dx. 
[0,1] 
lw, 2.18. For what values of the constant p are the integrals 
| x Psin x dx, | x Pecos x dx, 
[0,1] [0,1] 


convergent? Note that 
0 s xPsin xs xP (0<x¢ 1). 
(New University of Ulster 1983, M 112). 
iw, 2.14. Is the following integral convergent or divergent? 


yx.cosec x dx. 


PR 
(New University of Ulster, 1973, M 211). 


te. 2.15. I= | log,sin x dx 
[0 s3m] 


is a mos interesting integral. Over the range (0,4m) the graph of sin x 
is high „han the chord joining the end-points (0,0) and (47,1). Hence 
sin x > (2x)/q, |log,sin x| = log, cosec x < Tog, (1r/(2x)) which tends to 
t w slower than any negative power of x. Hence convergence from 


b 


x“log,sin x > 0 (x + 0+). 


lo evaluate I put x = 4m-y, and later, z = 2x, t = q-z, to give 
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; 
I= log,cos y dy, 2I = | log, (sin x cos x) dx 
[0,37] [0,37] 
«| log .(4sin 2x)d 4nlog 2 + 31 
= x = -3nlog 2 + 
[0,3] © à 
+ log sin z dz = -4n log. 2 + I, I = -4n 10g_2. 
JEn] 8 s pres 


Ex. 2.16. Show that f:[0,1] + R given by f(x) = 2n n? for 


1-n - mh ai 
1-27" sx < 1-2, and #(1) = 0, is integrable over [0,1] by a 
Cauchy limit. 


(University of Ulster, 1986, M 112). 


p-1 
Ee. 8.17. | X — dx. Prove that this is convergent if and only 
[0,+0) 1+x 

if 0 <p< 4, 
(New University of Ulster, 1975, M 211). 
Ex. 2.18. Prove that A j ee d ae 

eels, rove tha = x, B= | dx are 

[0,1] 1+x [1, +0) 1+x 


convergent with B = -A. Deduce that if a > 0 then 
| = dx = (log a)/(2a) 
[0,+0) aex? e l 
(New University of Ulster, 1978, M 211) 
Ex. 2.19. Show that the following is convergent, and evaluate it. 
oe x? 10g,X dx 
(New University of Ulster, 1977, M 112). 


Ex. 2.20. Integrating by parts, show that if p > 0 is a constant, 


x Pcos x dx and x Psin x dx 


las ee 


are convergent. Using Ex. 2.13, find the ranges of p for which the 
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corresponding integrals over [0,+%) are convergent. 


2.21. Using integration by parts prove the convergence of 


| Sin X gy 

[2, +00) Tog. 

2.22. Prove the convergence of the sequence whose mth. term is 
1. log. (m+1) 
jmo o ° 


1.23. For0<Ak<B,0<a < b rearrange the integral 


Wil, Ged. 


| cosbx-cosax gx 

[A,B] 

in terms of integrals of cos x/x over two ranges, one range depending 
only on A and another depending only on B. Using |1-cos x| < x” for 
mall x, in the first range and integrating by parts for the second 
range, show that as A + 0+ and B > + œ, the integral tends to log, (a/b). 


(New University of Ulster, 1977, M 211). 
bu. 9.24. In Ex. 2.20, when p is fixed in 0 < p < 1 the second integral 


is not absolutely convergent. For 


w] Lilas z | siny ya | siny y 
Erm] xP j=2 JEO, nJly+(j-1)n}P  j=2 JE0,n] (jn)? 
A m 
2 sH Al a as mM > œ 
P j=2 X 


)}, The Calculus Indefinite Integral And The Riemann-Complete Or 


Generalized Riemann Integral 


A still earlier definition of the integral is that of Newton. The 
ulculus indefinite integral (or Newton integral) F of the function f over 


[a,b], ts any solution of the differential equation 


(3.1) dy/dx = f(x) 


in [a,b], the derivative being finite at each point of [a,b]. Thus, given 
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€ > 0, there is a 6 = 6(x) > 0, depending on e and x, such that 


(3.2) Ebe) = Fd) - #(x)| <e, [F(xep) - F(x)-F(x)p| < e |p| 


fora <x b, a < x+p $b, 0 < |p| <6(x). 


The first point is that the Newton integral can only integrate 
derivatives, and these have one property of continuous functions. 
Darboux has shown that 
(3.3) tf F'(x) exists at all points of [a,b] with F'(b) # F'(a), 

and if z lies strictly between F'(a) and F'(b), there is an 


x ina < x <b with F'(x) =z. 
Proof: We can replace F(x) by +(F(x)-zx) and so assume that 
(3.4) F(a) < 0 = 2 < F"(b). 


Then the infimum in [a,b] is attained at some point x in [a,b] since F 

is continuous, and x cannot be a nor b, because of the inequalities in 

(3.4). Hence F'(x) = 0, proving the result. Hence 

(3.5) cf g(x) = 0(x < 0), g(x) = 1(x 2.0), then g is not a derivative 
and does not have a Newton integral, since g does not take the 
value 3. 


Continuous functions on a bounded interval or a compact set are 
bounded, but not all derivatives are bounded and Riemann integrable. 
For example, if 


(3.6) F(x) = x®sin(1/x2) (x #0), F(0) = 0, 
F'(x) = 2xsin(1/x2)~(2/x)cos(1/x2)(x # 0), F'(0) = 0, 


and F' is unbounded at 0. 

There is no standard calculus construction of the Newton integral, 
but when found, we recognise it by the defining property (3.1). Thus, 
as usual in the calculus, we have to look at the differentiation process 
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DF = dF/dx and so the operator D with the following properties. Let 
1,G be differentiable and let m,p be constants. Then 


(3.7) D(mF + pG) = mDF + pDG, 
(3.8) D(FG) = (DF)G + F(DG), 
(3.9) D(F(G)) = (dF/dG@)DG, 


(3.10) Dm=0. 


iT] 


r -1 Š 
Ihus the integral, which we can write F=D f, has the following 
properties. 


7 i -1 
(3.11) ov (me + pg) = mb's + pD`'g 


for derivatives f,g and constants m,p, 
(3.12) o”'(Fpe) = FG - D7'((DF)G), 


the formula for integration by parts, which follows from (3.8). From 
(3.9) with D, for d/dG and h = D,F, we have integration by substitution, 


(3.13) Dy7'h = D7’ (hDG). 


Ihe more general integral to be defined later, still in some sense obeys 
(3.11), (3.12), and (3.13). By (3.10), F = p-'f is indeterminate as we 
can add an arbitrary constant, but as it is constant it cancels when we 
calculate F(b) - F(a). 

(3.14) The Newton integral of a derivative f over [a,b], ts uniquely 

defined, apart from an arbitrary constant. 

Proof: The derivative f is given finite for each x in [a,b], so that if 
DF = f = DG then D(F-G) = f-f = 0, for each x in [a,b]. By the mean value 
theorem, F-G is constant and so F(b) - F(a) = G(b) - G(a). 
(3.15) . If f ts Newton integrable to F over [a,b], and Riemann integrable 


to I then I = F(b) - F(a). 
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Proof: By D'F = f and the mean value theorem for each interval of the 
partition (1.1) we have a Riemann sum s, using suitable Xs 


F(b)-F(a) = 


m 
J= 


m 
AE = A f(x;)(uj-u;_,). 
As the mesh of (1.1) tends to 0, the last sum tends to I, while the sum 
remains equal to F(b)-F(a). Hence the result. 

From (3.2) we can construct F from the derivative f if we can find 
a partition (1.1) in which each interval Luju; is associated with a 
point x. in the interval with uju; < 5(x5), 6 being the function of 
(3.2) that is defined over [a,b]. For 


( Stats Sie as 
ss ie aig S, 


|F(u.) - F(x;) š F(x5)(u5-x,)| s elu; =X); 


J 


[F(x;) = Flus) = f(x) (xjzu; g) $ e(xj-u; 
[F(u;) - Flujq) - F(x;)(uj-u5_ 4) < e(u.-u. pe 
It follows that 


(3.16) |F(b) - F(a) - s| < e(b-a) 


where s is as in (1.3), the special Riemann sum for the division E. If 


also Xj = uj or Xj = Us_ys for j = 1,2,...,m, the author has said that E 


is compatible with 6, or that E is §-fine. The latter is shorter and will 


be used, it was first given by McShane, though McShane drops the 
restriction on x. and only assumesthat x. € [a,b]. The function § > 0 
depends on g > 0, and as ¢ + 0+ we would have to let §(x) + 0+ at each 
x € [a,b], and then by (3.16), s tends to F(b)-F(a). This is the one- 
dimensional case of the kind of limit we use in this book, and in 
lectures the author has said at least from 1962 onwards that the limit 
occurs ‘as § shrinks’. 
If in the division for (1.1), ujq < Xj < uj for some j, we can 

replace Lu; 42u] by Lu; 4X5] and [x;>u;], so that Xj would be at one 
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end of each interval. This will not change the sum s since 


F(xj) (uju; 4) = F(x5)(U5-x 5) + F(x5)(x;-u5_4). 


It will turn out later that divisions E with U5 4 
(j = 1,...,m) have advantages and better properties than those in which 


=X, or U, = X, 
J J J 


the only restriction is U5 < Xj < Uj» while we have the same 
collection of values s. 

Using -fineness we can now give a formal definition of the Riemann- 
complete or generalized Riemann integral over the interval from a to b 
of a function h([u,v),x) of interval-point pairs (Lu,v),x). First we 
use intervals [u,v) closed on the left and open on the right, for 
simplicity of description. If u <v <w then [u,v) and [v,w) are 
disjoint with union [u,w), of the same type. Using closed intervals in 
this example, the intervals would have v in common, and we would have 
to say that the intervals are non-overlapping. Next, the generalized 
kicmann integral is a number H with the property that for all 6-fine 
divisions E of [a,b), 


(3.17) |(E) = h(Lu,v),x) - H| <€, 


where the positive function 6 depends on the arbitrarily small e > 0. 
We use the notation 


H = | dh, or H = | fdx 
[a,b) [a,b) 


if h([u,v),x) = f(x)(v-u). 


One essential requirement is given by the following theorem. 
Iheorem 3.1: Given §(x) > 0 in [a,b], there is a $-fine division of [a,b]. 


Proof: To each x € [a,b] there corresponds a symmetrical open neighbour- 


hood 
I(x) = (x-6(x), x + 8(x)). 


ly Borel's covering theorem, as a,b are finite a finite number 
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I(x, ) (k = 1,...,m) covers [a,b], i.e. every point of [a,b] lies in at 
least one of the I(x) ). 


(3.18) We ean arrange that each point of [a,b] lies in at most two 


of the I(x,). 


For let I(u), I(v), I(w) have a common point andasu<vewsb. 
If 


v-§(v) su-s(u) then g(v) = g(u)+v-u > g(u), 
v+s(v) > v + (u) > utg(u), I(u) c I(v), 


and we can omit I(u). Similarly, if v + (v) 2 w + 6(w) then I(w) c I(v) 


and we can omit I(w). Otherwise, as all three intervals have a common 
point, 


u - 6(u) < v-é(v) < v + 6(v) < w+ (w) implies I(v) c I(u) u Iw), 


and we can omit I(v). Successive applications of these results give 
e. < Xn $ b with 


(3.18), from which we can assume that a < X4 < Xp < 
€ I(x). The inter- 


consecutive I(x) (k = 1,...,m) anda € I(x,)5 b 
section 


(X55 44) n I(x;) n Ixia) 


is not empty and so is an open interval, in which we choose a point Ys 
Thus Xj < Yj < Kaye We can construct a partition (1.1) with Ug = â, 
Uy = xy (if x, > a), or otherwise Uy = Y4> while Uy is Y4 OF Xys 
respectively, and so on, so that a, b, Xn? Xj» Y (j = 1,...,m-1) are 
the division points. By construction the division is §-fine and the 


theorem is proved. 
Theorem 3.2: The integral of h over [a,b) ts uniquely defined. 


Proof: Let two numbers Has Hy have the property of H. Then, given e > 0, 
there are functions 5, (x) > 0 such that 
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(3.19) |Œ) z h([u,v);x) - Hg] ze (Rk = 152); 


For &(x) = min (8,(x), S5(x)), then 6(x) > 0, and by Theorem 3.1 there 
is a 6-fine division E of [a,b). This division is 6,-fine and 6-fine 
and so is an E} and an Ens and from (3.19) 


JH, - Ho] = |{(E) Zh - Ho} - {(E)Eh- H}| < 2e. 


Thus Hy = Ho as they are independent of £ > 0. 


By (3.16) the Newton integral is a generalized Riemann integral; and 
so is the Riemann integral, which can use constant functions 6. In 
general the (x) can vary from point to point and, for example, could 
tend to 0 as x tends to some fixed number w in [a,b] even though 6(w) > 0. 
A typical construction is as follows: 


(3.20) For all x € [a,b] with x # w let 0 < 8&(x) < |x-wl. 


By Theorem 3.1 there is a 6-fine division of [a,b) of the form (1.1) with 
X55. If Xj = uj # w then 5(x;) < lu;-wl and neither Cu5_4 451 nor 

Cu; Us ,4] can contain w. Thus w is only contained in the closure of one 
or two intervals of the division with end-point w, and w has to be one 

of the points of division, with some Xp = Ws and [uy 42) > [Wu 4) are 
intervals of the division, unless w is a or b, when one interval is 
missing. We can arrange a similar construction for a finite number of w, 
including a or b or both, or we could use a sequence of points w. See 
Ex. 4.1 for the n-dimensional case. 

Another construction shows that sometimes the Lebesgue integral is 
also a generalized Riemann integral. Let (X) be a sequence of mutually 
disjoint sets of real numbers, each set being of "measure zero", i.e. for 
each k and each € > 0 there is a countable union G of disjoint open 
intervals I with G > Xk and mG < e, where mG is the sum of lengths of the 
I of G. If X is the union of sets Xe and if a function f is such that 


IFO] 3 2 (EX. k=1,2,...), f(x) = 0 (x EX”), 


k? 


we call f a null function. 
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Theorem 3.3: The generalized Riemann integral of a null funetion over Theorem 3.4: Let f be a finite real valued function, Lebesgue-integrable 


[a,b) zs zero. over a measurable set M of finite measure, with Lebesgue integral 


Proof: Given € > 0, we choose Gk and §(x) > 0, such that (L) i f(x)dx =H 
JM 


Gk 2 Xe mG, < qk. S(x) < 1 (x EXX), (x-8(x), x + 6(x)) €G Then, given ¢ > 0 and x E M, there is an open set G(x) containing x, such 


k 
that whenever 


(X © Xis K= T Brao 


k? (3.21) Ty sIgs--- are disjoint measurable subsets of M with 
For a 6-fine division E over [a,b), the only nonzero f(x) have x € X, so - 
that m(MN u I.) = 0 
= "A jot J 
k -k _ 
I(E) z f(x)(v-u) - 0| < 2 2 me, < = €.2 "= €. and X15X55+.. are points satisfying x. € 1. c G(x.) (j = 1,2,...), then 
k=1 k=1 12 J J= J 
Hence the integral is zero. | z f(x,)m(1.) - H| < e- 
A typical null function is the indicator of the rationals, the j=1 J 


function that is 1 at each rational and 0 at each irrational. For we Here the measure of the set X is m(X). 
can write the rationals p/q as a sequence 
Proof: By the absolute continuity of the Lebesgue integral there is an 


B/N, WN, “AN, Bi y “2/1, 2, “1/2, 3/14 ~3/15 2/2, -2/2, n > 0 such that for measurable sets X c M, m(X) < yn implies 


1735 4735 Alia sn 
(L) Í |f]dx < 6/3. 
x 


in which |p| + q takes the successive values 1 once, 2 twice, 3 four times, 
4 six times,... while |p| goes from its maximum down to 1 in each group For g = e/(3n + 3m(M)) and k = 0,41, + 2,... let 
with constant |p| + q, taking q > 0. Removing the second and later 

appearances of each rational, we have a sequence (rg) that takes each 


HA 


X = {x : (k-1)g < f(x) < kg}. 


rational once and once only. The intervals Choose for each k an open set Gk > A such that 


ET T ET (3.22) maX) < tllt, 
form a set G with mG < £, that encloses the sequence (rg) of rationals. and take G(x) = G whenever x € X,. If (3.22) is satisfied, with X) EMRE gy? 
As e > 0 is arbitrary, the indicator of the rationals is a nuJ1 function. then 
The null functions are typical Lebesgue-integrable functions. be TR I Xk js S kg)? |z f(x;)m(1,) - H| 
Assuming a little of Lebesgue theory we can prove that the Lebesgue J= RS a Rd J J js J 
integral is included in the generalized Riemann integral. -|5 wf {F(x.)-F(x) }dx| <5 wf |f(x,)-f(x) |dx 
it, t ji o Y 
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<r (L) Jo 
< m [ Fx 5) -F 0x) [dx 


+ wf [f(x)dx + z wf |#(x)|dx = R+S +T, 
say. We show that each sum is less than ¢/3. If x € I, n X) f(x) 
lies in the same range ((k(j)-1)g, k(j)E] as F(x5), so that they can only 
differ by + =, and 


Rs = wf č dxsg z mI.) = gm(M) < ¢/3. 
j=1 Jin ae COC 

J KREG) 
In S we collect those terms (if any) for which k(j) has a given value k. 
Then 


| (x5) [dx se 2 Mh ai 


f(x;)| < (]k]+1)g, S= z 
| Fix5)] < Ck]+1)g í EN 


(s) 


N X) 4 i (|k]+1)Em(G >X) < ¢/3. 


Finally, for Y the disjoint union of sets ISl) 


mY) = 5 zo mIaX)s £ MGN) <n. 
kero klijk JK” p KK 8 


The proof is complete by definition of n, since 
f 
T= (L)| |f(x)]| dx < ¢/3. 


Thus the generalized Riemann integral includes the Lebesgue integral, 
by taking in (3.21) a finite number of disjoint intervals forming a 
partition of an interval, together with suitable points Xj forming a 
division of the interval. However, the generalized Riemann integral is 
wider as it also includes the Newton integral, and, for example, |F'(x) | 
from (3.6) does not have a generalized Riemann integral, since 


(2/|x]) |cos(1/x°) | 
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is not generaliz ed Riemann integrable, as can be proved. Actually it 
can be proved that the generalized Riemann integral is equivalent to the 
Perron and special Denjoy integrals when integrating f(x)(v-u), and it 
can integrate interval-point functions. 

A limitation of the generalized Riemann integral that is shared by 
the Riemann-Stieltjes integral and the integral by refinement of partitions, 
is brought out by the following theorem. 


Theorem 3.5: For X a set everywhere dense in [a,b] if f, g, g, are point 
functions in [a,b] with g = 94 in X, and if the generalized Riemann 


integrals in (3.23) exist, then 
(3.23) f fdg- f fag, = Flb)(9(b)-9,(6)) = Fla)(a(a)-g,(a)). 
[a,b) [a,b) 


Tf, for two different constants G> Gy» g = G; in a set everywhere dense 
in [a,b] and g = G, in another set everywhere dense in [a,b], the only 
function f that is generalized Riemann integrable relative to g in [a,b], 
is a constant function. 

Clearly, if we wish to integrate more than constant functions 
relative to g, then g cannot oscillate too much, or else we would have 
to use a convergence-factor integral such as the Gesaro - Perron integral 
of J.C. Burkill, modified to be of Stieltjes form. 


Proof: In Theorem 3.1 let the points yj (which lie in certain intervals) 
be taken from X, for j = 1,...,m-1. Then for such divisions the sums of 


F(x5)(g(x5)-g4 (x5)-o(y 5_4) +94 (¥5_4)) + F(x; gly; )-g4 (y5)-a (x5 494 0x5)) 


= F(x, )(glyj)-94(yg)-oly5_4)+94(y5_4)) = 0. 


The only terms that do not cance] are the ones for a and b, and the 
generalized Riemann integral of f relative to 9-9, is equal to the right 
of (3.23). If for the two different constants G, and Go we can replace 
94 by G, and by Go» the two given values of the same integral must be the 
same, which gives f(b) = f(a). It can be proved (see Theorem 5.1) that 
the integral also exists for all [u,v] c [a,b], so that f(u) = f(v). 
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Hence f has to be constant if the generalized Riemann integral exists. 


Ex. 3.1. Let f(x) = 1-g(x) = i (x < 0) . Using (3.20) for w = 0, prove 
0 (x > 0) 
that the following integrals exist. 
| fdg = 1, | gdf = 0. 
[-1,1) [-1.1) 


As f and g have discontinuities at x = 0 on the same side, the refinement 
integral over [-1,1] does not exist. Similarly the generalized Riemann 
integral of f relative to g exists when f and g are both of bounded 
variation, though the refinement integral does not always exist. 


Ex. 3.2. Let f(0) = 1, f(x) = 0(x > 0), and for an interval-point 
function h(I,x) in [0,1] let the generalized Riemann integral of fh 
over [0,1) exist with value K. Then 


lim h{0,u);0) = K. 
u+0+ 


(Given e > 0, there is a positive function § on [0,1] such that every 
6-fine division (E) of [0,1) satisfies 


I(E) E fh - K| <e. 


Let 0 < u < 6(0). Then there is a 6-fine division of [0,1) that uses 
(C0,u),0) and so 


|h[0,u);0) - K] <e. 


Hence the result.) 
More examples are given at the end of Section 4. 
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CHAPTER 2 


SIMPLE PROPERTIES OF THE GENERALIZED RIEMANN INTEGRAL IN FINITE 
DIMENSIONAL EUCLIDEAN SPACE 


4 Integration Over A Fixed Elementary Set 


We begin with mutually perpendicular co-ordinate axes 0X4 +++ 50x, in 
n-dimensional Euclidean space with the Pythagorean distance from 


x (xy 20+ oxy) toys VEREAP 


a 2 
lx- yll = viz (x; - y.)%. 


a= (agase ah) and b = (bjs... bp) with aj < b5 (3 = 1,...,n), define a 
brick I, written [a,b), which is the set of all points x = (Xjes 2X) with 
ay < xX. < b.(j = 1,...,n). Chapter 1, Section 3, takes n = 1 and bricks 
[a,b) with a < b, b not being attained by points in the brick in order 

that for a < b < c, [a,b) and [b,c) are disjoint with union another brick 
[a,c). Similarly we do not let x. attain b (J = 1,...,n) when n > 1, while 
x remains in the brick. The closure IĈ of I is the set of all x satisfying 
aj < Xj < bj (j = 1,...,n), written [a,b], and the sets of all x € 1° with 
X. = a., and those with Xj = bj» for a single j in 1,...,n, are called faces 
of I. The 2" special points x with Xj = aj or Xj = b; for each j = 1,...,n, 
are the vertices of I. If Yj has the other choice of aj or bj after Xj is 
chosen, then y = (Ygs. Yp) is called the opposite vertex to x, and 

|x - y|| is the diameter of I, diam (1). 

A finite union E of mutually disjoint bricks I is called an elementary 
set, and its closure E° is the union of the corresponding I°. An open sphere 
S(x,r), centre x, radius r > 0, is the set of all y with ||x - y||< r. Leta 
positive function (x) be defined for all x € EC. Then a brick-point pair 
(I,x) is -fine if I c S(x,6(x)) with x a vertex of I. A partition P of E 
is a finite collection of mutually disjoint bricks I with union E. A 
division D of E is a finite collection of brick-point pairs (I,x), the bricks 
I being mutually disjoint with union E, and so forming a partition of E 
that, we say, comes from D. Further, Dis -fine if each (I,x) € D is 6-fine. 

Let h(I,x) be a finite-valued function of brick-point pairs (1,x) with 
ITcEandxe EC. We then say that h ts defined for ES. 


h(1,x) = f(x)v(I) where, for I = [a,b), 


For example, 
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the volume of I. Then the generalized Riemann integral of h over E is a 
number H with the following property. Given e > 0, there is a positive 
function 6 on ES such that 


(4.1) |[(D) ch(I,x) - H| < € 


for all -fine divisions D of E. Here, (D) xh(I,x) is the sum of h(1,x) 
for all (I,x) € D. This definition (4.1) can apply to any space K of values 
of h that has an addition (or multiplication using m instead of £) together 
with a metric or topology to replace the modulus in (4.1). We use the 
notation 


H = | dh, or H = | fdk if h(1,x) = f(x)k(1,x) or f(x)k(1). 
E E 
If h and |h| are (generalized Riemann) integrable over E, we say that H zs 
an absolute integral. If h is integrable but not |h|, we say that H zs a 
non-absolute integral. We sometimes use the set S(h,63E) of values (D) Zh 
for all 6-fine divisions D of E. 
We need the following theorem to make the definitions viable. 


Theorem 4.1: Given a positive function § on fs there is a -fine division 
of E. 


Proof: Suppose the theorem false. Then as E is a finite union of mutually 
disjoint bricks I, if a -fine division of each such I exists, the union of 
the separate divisions is a §-fine division of E. Thus the theorem is 
false for a brick I, z.e. there is no §-fine division of I. We now use 
repeated bisection, we bisect I in the Xj direction by the hyperplane 

Xj = ia, + b.), with Xk arbitrary for all k # j, and repeat this for 

j = 1,...,n, obtaining 20 smaller bricks J. By the previous argument from 
E to I, one of the J has no 6-fine division. Repeating this construction, 


we have a sequence 
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1 2 3 


E Ee a Biss 

of bricks with no 6-fine division. For (xd) the sequence of centres of 

the bricks, xe Dk 2 j), |x“ - x || < diam(1) = 217} diam(1')+ 0 (j + @), 
and (x2) is a fundamental and so convergent sequence, say with limit x. 

S(x) > 0 exists as xk € J cIcE(k2j),x€ E°; and x € PS diam(IY) > 0, 
J c S(x,6(x)), eventually. For m= 1,2,...,n the hyperplane loci of y 

given by 


(Yn = Xn? Y, arbitrary for k # m) 
cut up IJ into at most 2” smaller bricks J each with vertex x such that 
(J,x) is 6-fine. Thus there is a 6-fine division of 1, contrary to its 
definition. This contradiction proves the theorem. When n = 1, Theorem 3.1 
gives another proof. 


Theorem 4.2: The integral is uniquely defined. 


Proof: Let h(I,x) be integrable to H} and to Ho both over the same E. Then 
there are positive functions op on ES and depending on e > 0, such that for 
5,7 Fine divisions Des 


(4.2) | (D,) rh(I,x) - Hy | < elk = 132); 
Then (x) = min(S, (x), 55(x)) > 0 (x € EĈ) and by Theorem 4.1, there is a 
s-fine division D of E. As D is also ĉ-fine and do-fine, the spheres being 
the same or larger, (4.2) is true for D = D} = Dy. Hence 

[H, - Hal = [{(0) Zh - Ho} - {(0) oh - Hy} < 2e, 
while Hy Hy are independent of s > 0. Hence Hy = Hos giving the uniqueness. 
Theorem 4.3: Let the function h of brick-point pairs, defined for E, with 


real or complex values, have the property that given e > 0, there is a positive 


funetton § on E° such that for each pair D, D' of S-fine divisions of E, 
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(4.3) (D) eh(1,x) - (0') ch(I,x)| < e- (4.5) If also h; h, are real-valued with h, 2 h, for all relevant 


v 


(I,x), then H H 


ree 


(We then say that (D) xh ts fundamental (E).). Then h is integrable over E. 


Proof: Given g > 0, there is a positive function ŝj on E? such that 
Proof: Let 8; be a positive function on EC such that e = 1/j in (4.3). As 
= 2 
(4.6) |(0;) Zh (14x) Hj| < že 


.(x)) > 0, 


min(54(x) s69(X) 54-4385 


for all 6;-fine divisions D; of E (j = 1,2). Taking (x) = min(S4(x)s 
it can replace 65 (x), and so we can suppose that at each x € ES, 5o(x)) > 0, a -fine division D exists (Theorem 4.1), and D is 5;-fine 
(j = 1,2). Hence 


64(x) 2 S9(K) Be 2 Slr) È wes 
|(D) &(hythy) - (HytHy)| = [{(D) Zhy-Hy} + {(D) Ehy-Hy}| < £, 


Then each Sp fine division of E is 6;-Fine for all j < k. For each j choose 


a 6;-fine division D; of E, and so a sum |(D) sch, - cH, | = |c] | (P) Zh, - H| s 2[c| e+ 0 

Sj = (0;) sh(I,x). Is; - sql < 1/j (k S3) with e. Hence the results in (4.4). For the third result, 
from (4.3) as Dg is 5;-fine (k 2 j). Hence (s;) is a fundamental and so H, + Be > (D) h, 2 (D) Eh, > Hy ~ de, Hy - Ho > ~-e. 
convergent sequence, say with limit s. By (4.3) again, with sj-fine divisions 
D of E; As Hy - Ho does not depend on e > 0, it follows that H, - Ho 2 0. 

In (4.4), hy and hy can be complex-valued with c complex, and for 

I(D) zh - s| < 1/3 (all k2 j), |(D) rh - s| < 1/3, such functions there is a partial converse to the first part of (4.4). 
and we can take j = 1/2,... Hence by definition the integral of h exists Theorem 4.5: Let Ky. ky be real-valued brick-point functions defined for 
over E with value s. an elementary set E, with k = Ky + ikg. If the complex-valued k is integrable 

A similar result holds if the space of values of h is a more general over E to K then kj is integrable over E to K5(3 = 1,2) where Kjo K, are the 

complete space with addition and a suitable topology. real and imaginary parts of K. 
Theorem 4.4: If hy and hy are two brick-point funetions defined for E, Proof: First, if a,b are real then 
that are respectively integrable to Hy, Ho over E, and tf c ts a constant, 
then Ja + ib| = (a + b°) z max(la|, |b|). 


+ hy and ch, are respectively integrable to H, + Ho and cH}. Hence, given s > 0, there is a positive function 6 on EC such that for all 
‘-fine divisions D of E, 


(4.4) hy 
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et a a —- = - 


e> |(D)z(k;+iko)-(K}+iK3)| = |{(P)zk;-K}} + i { (D)Zko-Ko}| 2 [(P)zk;+K; | 


(j = 1,2). Being true for each e > 0, by definition Kj is the integral of 


k5 (3 = 452) 


We now have a geometrical result vital to some parts of the theory. 


Theorem 4.6: (4.7) For two disjoint elementary sets Ej» E, there is a 


positive function 6, on E, U a such that if (I,x) is Sp-fine then 


Te E,or Ie Ez. 
(4.8) Let bricks I,,...,1,, form a partition P of an elementary set E. 
Then there ts a positive function Sp on E? such that every Sp-fine 


division D of E is a refinement of P, i.e. tf (1,x) ED then I = I, 


HA 


for some j in 1,...,m. We write D < P. 


Proof: Let E,° be the interior of Ej and A the boundary or frontier 


ES XN Ee of Eje As E; is a finite union of bricks, the geometry shows that 


Ej is a finite union of parts of faces of the bricks, the parts being (n-1)- 
dimensional bricks. If x € Pi U Es let 25p(x) be the distance from x to 
p” U Ey If x € F U Be let 25p(x) be the minimum distance from x to 
the parts of faces in Ni U ES” on which x does not lie. Then Sp(x) siu TF 
(rx) is Sp fine and I. has no points in common with A U EP then x € Eee 


Ic Ey, or x € E,?, Ic E). If (1.x) is &-fine and I° n (EP u EP) 


is not empty, then by construction x € g’ U Ai 


any part of a face in BP U E on which x does not lie. If x lies on face 
F, then as x is a vertex of I, no interior point of I lies on F. As 

Ic Ey U E, then I cannot have interior points in E} and other interior 
points in Eos or else a face would have interior points of I. Hence either 
T&E 


and IČ does not intersect 


į or Ic E, proving (4.7). 


To prove (4.8) let 8; be the 6, of (4.7) for which E = I, and E, = EI. 


Then min (54 5+++ 25) > 0 is the Sp for (4.8). 
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Corollary: Every refinement or o-integral in R" using bricks, ts a 


generalized Riemann integral. 


In many parts of analysis, monotonicity plus boundedness often leads 
to convergence. Here is an example. First, a real-valued function h(I) 
of the bricks I c E, is finitely sub-additive if, for each I c E and each 


partition P of I, 


If the inequality is always reversed, h is finitely superadditive. 


Theorem 4.7: If the real-valued h is a finitely subadditive brick funetion 
in E and if, for some positive function ô on E° and alt $-fine divisions D 


of E, the set of sums 
s = (0) zh(I) 


is bounded above with supremum H, then h is integrable over E to the value H. 
If, on the other hand, the set of sums s is unbounded, then for each 
integer m there ts a positive function on on E? such that s 2m for all sums 


s of h over Oy fine divisions of E. 


Proof: Given a division D over E, and so a partition P and a sums, by 
Theorem 4.6 (4.8) there is a positive function 5p on EC such that all 
Sp-fine divisions D' of E satisfy D' < P. If sums s are bounded above by a 
supremum H then, given e > 0, there is an s lying in H-e < s <H. Ash is 
finitely sub-additive, 


H-e < (P) ch(I) < (D') ch(1) <H, 


and by definition h is integrable to H. On the other hand, if sums are 
unbounded above, for each integer m there is an s z m, froma partition P 
of E. Thus 


mss = (P) Zh < (D') rh(1) 


for all Sp-fine divisions D', finishing the proof. 
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The only difference between Riemann and generalized Riemann integration 
lies in the fact that the first uses a constant mesh over the whole of the 
elementary set E on which the integral is defined, whereas the second uses 
a variable mesh controlled by a positive function §. It is therefore useful 
to look at a few examples of such positive functions. Two are given in 
Section 3, and here we generalize them to R’, 


Es. 4.1. For w€ E° suppose that s(x) < |x-w| when x # w. Then a §-fine 
(1,x) can only have w € IC when x =w. Thus we force w to be a vertex of 
at least one brick from each §-fine division of E. We can apply this 
construction to a finite number Wy Wo sees oWs of points of EC to obtain a 
function 8; > 0, or to a sequence (ws), that could be dense in ES; by using 
the corresponding sequence (85). 


Ex. 4.2. For Gan open set in R” and each x € G, we can arrange that the 
sphere 


S(x,6(x)) c G. 


m 


Ex. 4.3. If S(j = 1,...,m) are positive functions on ES, so is min(S,5+++38 JA 


The same cannot be said of an infinite sequence of positive functions 
unless the sequence has special properties. For if 5; (x) = 1/j(x € ES), the 
minimum is 0. 


Ex. 4.4. If a positive function 6 (x)(x € ES) is given for each y € ES, 
6(x) = 5, (x) is also a positive function. This is like a Cantor diagonal 
process. 


Exc. 4.5. In one dimension let PO.) be the families of all [u,v) (respec- 
tively, all ({u,v),u) and all ([u,v),v)) that satisfy 


O<u<vsi, u(m+1)/m<vsum/(m-1) (m22) . 


If 0 < 6(x) 
v-u < u/(m+1 


x/(m+1) (x > 0) then no member of Da can be 6-fine. For if 


< 
) then v < u(m+2)/(m+1) < u(m+1)/m. If v-u < v/(m+1) then 


vm/(m+1) < u. 
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Ec. 4.6. Let P and D be the respective unions of the Pii and Da’ for 
m= 2,3,..., in Ex. 4.5. Then P is the set of [u,v) with u < v < 2u, 
0<u<v <1, and D includes the end-points u,v as well. 


Ex. 4.7. If the integrable hal TX) > 0 for all é-fine (I,x) in E° and 
some 6(x) > 0, with 


h(I,x) = © hi(I.x), 
m=1 
finite and integrable, then 


(4.9) | dh = | dh: 
E m-1 E 


For in Theorem 4.4 (4.5) and each integer m, we use 


Ex. 4.8. There may be inequality in (4.9). For in Bx. 4.5 let 


v-u ([u,v) € Pas m> 1) v-u (u < v < 2u) 
halus) x) = { h(Cu,v),x) = 
0 (otherwise) 0 (otherwise) 


dh = 0, l 
lia m [0,1) 


Q 
— 
I 
> 


Be. 4.9. In Ee. 4.8, if 


then k is not integrable in any interval of (0,1). 

For, given a positive function 6 on [0,1], let S be the non-empty set 
of all x in [0,1] such that either x = 1 or the supremum of sums of the k 
over -fine divisions of [x,1) is 1-x. If s is the infimum of S then 
0<s<tandsesS. If s > 0, there is an integer m such that 
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(4.10) ({(2m-1)s/(2m) ,s)s) E Do 


and is 6-fine. Thus (2m-1)s/(2m) € S, contrary to the definition of s. 

Hence s = 0 and the supremum, of sums of k over -fine divisions of 

[x y) c (0,1), is y-x. Similarly, replacing 2m by 2m+1 in (4.10), we . 
prove that the infimum of sums of the k over -fine divisions of [x,y), is 0. 
Thus the generalized Riemann integral cannot exist over [x.y)- 


Ex. 4.10. In R? let f be a point function and h 2 0 a brick-point function, 


both on Eĉ. If Xe RI, if f=0in XX (so that f = fx(X3-)) and if fh and 


x(X;+)h are integrable over E then 
m | x(X;.)dh < fdh <M | x(X;.)dh 
E E E 
where m = inf f, M = sup f, on X. For on X, 


mh < fh < Mh. 


we could be dealing with infinite values, of m or M or both. 


ave value when at least one of m,M is finite, and 
tegral 


(Exceptionally, 


The inequalities only h i 
then it can be proved that the integral of fh is equal to a Lebesgue in 


over X n E of f with respect to the measure given for varying bricks I c E by 


$ x(X;.)dh. 


fo . . 
Ex. 4.11. If in Bx. 4.10 f is continuous on X = E”, where E is a brick, 


there is a point & € EC with 


f fdh = f(E) i dh. 


For f takes all values from m to M, and in particular the value 


| fdh / | dh. 
E E 
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5 Integration And Variation Over More Than One Elementary Set 


A real or complex valued function h of elementary sets (possibly in a 
fixed elementary set) ts finitely additive if for each pair Eys Eo of 
disjoint elementary sets, so that E, U ED is also an elementary set, we 
have 


(5.1) h(E}) + h(E) = h(E, U E2). 

Such functions are most useful, in that a sum of them over partitions of an 
elementary set E, is equal to h(E). If h is real-valued and if instead of 
(5.1) we always have 


(5.2) h(E,) + h(E,) < h(E, U E2), 


we say that h ts finitely superadditive. On the other hand, if the 
inequality always goes the other way, 


(5.3) h(E, ) + h(E) 2 h(E, U Eo). 
we say that h is finitely subadditive. In the case (5.2), refinement of 
partitions implies that the sums over partitions fall, while in case (5.3) 
the sums rise. 

We say that Ey ts a partial set of E if both E; and E are elementary sets 
and Ey cE. If also Ey # E we say that E] is a proper partial set of E. 


Theorem 5.1: If a funetion h(1,x) of brick-point pairs is integrable over an 
elementary set E to H(E), then h is integrable over every partial set E] of 

E, say to H(E;), and H is finitely additive over the partial sets E}. 

Proof: Let E} be a proper partial set of E. Then by the geometry, ENE, is 


also a partial set. Given s > 0, if the positive §, defined on BF, is such 
that 


(5.4) |(D) sh(I,x) - H(E)| < € 
for all -fine divisions D of E, then by Theorem 4.1 there are -fine divisions 
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D, of Ey and D3 


a second §-fine division D; of E} with (5.4), if 


of ENE, » and D, U D, is a -fine division of E. Also using 


Sis =(D;)zh (=1..2.3)', [S4753] 


j | (Sy+Sy-H(E))-(S3+Sy-H(E))| < 2e, (D4) Zh 


is fundamental (E,), and by Theorem 4.3 the integral over E, exists, Say as 
H(E,). Letting S3 > H(E,), we have 


(5.5) [8 - H(E,)| < 2e. 


With the same 6, e, this is true for every partial set E; of E. If Eqs Eg 
are disjoint partial sets of E, so that Ey U Eg is a partial set, and if 

Sj is a sum over a §-fine division of EG = 4,5) then S4+S5 is a sum over 

a -fine division of Ey U ES» and (5.5) is true for each of the three partial 
sets. Hence 


|H(Eq UE, )-H(Eq)-H(E,)| = [{H(E, UE,)-(sy+s¢)} + {sq-H(E,)} +{sp-H(E,)}| <6e. 


But the values of the integrals are independent of e > 0, so that 


H(E, UE ) - H(E 


q V Es) - HCE 


4 5) 


and H is finitely additive over partial sets of E. 
Clearly this theorem is vital, and it is directed inwards from E to its 
partial sets. We now go outwards, to unions of elementary sets. 


Theorem 5.2: If Ejo Eo are disjoint elementary sets with a brick-potnt 
function h integrable over E, and over Eos then h ts integrable over E} U Eo. 


Proof: Given g > 0, there is a positive function j on R such that for 
all ôj-fine divisions 2; of Ejs 


|(0;) rh - HCE.) | < Red = 1,2). 


Let 6, be the positive function of Theorem 4.6 (4.7) and put (x) equal to 
; Cee ni ‘ Cet 
min(Sp (6) 54(x)) in E} NED min(Sp(x), 89(x)) in E, NE, 5 and 
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min(6p(x), 8, (x), S5(x)) in Ei OES” È EP n pr as the interiors are 


disjoint. This defines § > 0 in E,° U Ef, and if D is a -fine division 
of E, U Es, D splits up into a 6;-fine division 0, of E} (j = 1,2). Thus 


|(D)zh - H(E,) -H(E,)| = [(0,)zh-H(E,) + (0,)zh -H(E,)| <e. 
proving the result. 


Theorem 5.3: Given s > 0, let (5.4) hold for all 8-fine divisions D of E, 


where h ts real or complex valued. Then 
(5.6) (D) x{h(I,x) - H(I)| < 8e 


for all -fine divisions D of E. Conversely, let h(1,x) and H(E,) be given 
with H finitely additive over partial sets Ey of E. If, given e > 0, (5.6) 
ts true for some positive function § on EC , then h ts integrable to H on all 


partial sets of E. 


Proof: Theorem 5.1 gives (5.5) and the finite additivity of the integral H. 
Thus if D, is a 6-fine division of a partial set of E then 


|(D,) Z{h(I,x) - H()}] < 2e. 


Let D be a -fine division of E and let D, be that part of D with real(h-H) 2 0. 
Then (D,) z|real(h-H)| = real (D,) z(h-H) < |(0,) z (h-H)| < 2e, 


(D,) Z |real(h-H)| = (DD,) Zreal(H-h) = real (DXD,) z (H-h) 
< |(DND,) 2 (h-H)| <s 2e, (D) z |real(h-H)| < 4e. 
Similarly 
(D) 2 |imag(h-H)| < 4e, and so (D) z |h-H| < 8e. 


Conversely, if z 7 E let D,> dD, be -fine divisions of Ey and ENE, + Then 
D, U D, is a -fine division of E. By finite additivity of H, h is 
integrable to H(E,) over E} since 
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| (D,)zh -H(P)| = | (0) Z {h(1,x)-H(1)}| < (D, )Z|h-H| < (D,UD,) Z |h-H| < 8e. 


A function H of partial sets of E, is the variational integral of a 
brick-point function h in E, if H is finitely additive over partial sets 
of E and if, given e > 0, there is a positive function 6 on EC such that 
(5.6) is true for all 6-fine divisions D of E. In this notation Theorem 
5.3 says that the variational and generalized Riemann integrals are 
equivalent. 

A second definition of the variational integral is as follows. H is 
the variational integral of h if H is finitely additive over partial sets 
of E and if, given e > 0, there are a finitely superadditive function S of 
partial sets of E with S(E) < 8c, and a positive function ô on EF, such 
that for all 6-fine (I,x), 


(5.7) |h(1,x) - H(I)| < S(1). 
Clearly (5.7) implies (5.6), while (5.6) implies (5.7) by using 
S(E,) = sup (D') = |h-H| 


over all -fine divisions D' of Ey. For if Ej E, are disjoint, divisions 
D. over E} (j = 1,2) give the division D, U D, of E} UE,. But not every 
division of E] u E, can be split up into divisions D, of E} (G e i2) as 
the ô could sometimes be greater than a Sp: 

Theorem 5.3 also leads to the variation. Given a positive function ô 
on EF let 


V(h363E) = sup (D) Z |h(1,x)], 


for the supremum over all -fine divisions D of E. The variation of h in E 


ts 


V(hsE) = inf V(h363E), = lim sup (D) = |h(I,x)|. 
5>0 & > 0+ 


The supremum and infimum could be a conventional +œ. If the variation 
is finite, so that V(h;6;E) is finite for some positive function ô on Ec, we 
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ay that h ¢s of bounded variation (VB*) in E. If the variation is 0, we 
ay that h is of variation zero in E. Two brick-point functions h,k for 
ihe same ES are vartationally equivalent there if h-k is of variation zero 
inl, 

If X is a set in n-dimensions with indicator x(X;x) we write, 
respectively, V(h;6;E;X), V(h;E;X) for V(h.x(X3.)363E), V(h.x(X3.)E). 
li V(h;E;X) is finite, we say that h is of bounded variation in X, relative 
> 

Ihere should be no confusion between V(h;E;X) and V(h383;E) as the 
latter has a Greek letter. We say that a property is true h-almost everywhere 
(hou.e.) if it is true except in a set X with V(h;E;X) = 0. Such a set X 
is said to be of h-vartation zero. 

Much interaction between integrals and variations occurs, as the 
following shows. 


Ihoorem 5.4: (5.8) If the brieck-point function h for ES ts integrable to 
(1) over each partial set E, , then h-H ts of variation zero and h is 
sivfattonatly equivalent to H. Also 

(4,9) V(h;E;X) = V(H3E3X) 


jor each set X in n dimensions, even tf one side is to. 


(4,10) If V(h3E) = 0 and if E, ts a partial set of E, then 


V(h,E, ) = 


(8,11) ly dh 
1 


| 
oO 


and there exists 


LL 
oO 


(4,12) Conversely, tf (5.11) ts true for every partial set E, of E 


then V(h3sE) = 0. 


8,19) If |h| ts integrable to K over E then V(h3E) is finite and 


equal to K. 
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(5.14) Conversely, tf in (5.8) V(h;E) ts finite then |h| is integrable, and 


1A 


(5.15) |H(E,)I= in dh| f. djh| = V(h;E}), = V*(E,) (say). 
(5.16)  V(h;E;X) = V(H3E3X) = V(V*;E;X). 


Proof: (5.8) is just a rewriting of Theorem 5.3 (5.6) in two ways using the 
variation. For (5.9) we multiply by x(X;.) the inequalities 


(5.17) |h] < |h-H| + |H] , |H] < |H-h| + [h| 


and then sum over a division of E, showing that if one side of (5.9) is 
finite, so is the other, and then that the two are equal. For (5.10) and 
(5.11) we use a 6-fine division D, of Ej and when Ey # E, a -fine division 
D, of ENE, with 


(5.18) |(O,) zh| < (0,)z|h] < (D, u D5) xh]. 

Hence V(hE, ) = 0 and H(E,) = 0 follow from V(h3E) = 0. For (5.12), H(I) = 0 
for all Ic E, and (5.8) gives (5.12). For (5.13), given s > 0, there is a 
positive function § on Ec such that if D is a 6-fine division of E, 


K-e < (D)z|h| < Kte, K-e < V(h363E) < Kte, K-e < V(h3E) < K+e, 


and K = V(h;E), finite. In (5.14) we use (5.17) to show that |h| is 


integrable over E if and only if |H| is integrable over E, to the same value. 


By Theorem 5.1, H is finitely additive over partial sets of E, so that if 


D, is a division of a partial set Ey of E, 


1 
|H(E,)| = |(0,) HOD] < (0,) z [HCD] 
and |H] is finitely subadditive. Hence for each division D of E 


(D) = |H(I)|.s V(H;E), = V(h;E) 


IA 


using (5.9) with X = R”. Hence by Theorem 4.7, |H] is integrable, and so |h] 
) 


is integrable. (5.15) follows by (5.18) and the integrability. Then in 


(5.16), as in (5.9), 
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V(h;E;X) = V(|h|;E;X) = V(V*;E;X). 


A special case of (5.13), (5.14) is well worth setting out as a separate 
thoorem using the volume v(I) of the brick I. 


lhoorem 5.5: Let h(1,x) = f(x)v(I) be integrable over an elementary set E 
integral H(E,) over E; . Then |f(x)|v(I) is integrable over E if and 
ly if H is of bounded variation over E. 
Non-absolute integrals H are not of bounded variation, and for these 
wo need another definition. The brick-point function h on E° ts of 
voralised bounded variation (VBG*) in a set X relative to E, if X is the 
union of an at most countable number of sets Xj on each of which h is of 
bounded variation relative to E. 
If the sum of the V(h;E;X}) is finite, the next theorem shows that h 
la of bounded variation on X relative to E and we go no further. 


lheorem 5.6: If h is a brtek-point function on E° and tf (X5) ts a 


juenve of sets in R? with union X, then 


V(h3E3X) $ 


1M8 


WOME SAX aJi 
j (h3E X 5) 


hoot; IF some Xj have points in common, the left-hand side is unaltered, 
but the right-hand side will tend to increase. Thus we can assume mutually 
disjoint Kas and a finite right-hand side or else there is nothing to prove. 
Liven * 0, let the positive function S; on E? be such that 


19) v(ns8,3E5X;) < W(hsEsX5) + 2.29 (4 = 1,2,...). 


Wa take 


ô. (x) (x € Xj J = 1,2,...), 


1 (x EXX). 


if P is a S-fine division of E and if Qj is the subset of the (I,x) € D with 
E X, then as D is only a finite set, there is an integer m with Qj empty 
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for j >m. Hence by (5.19), 


m m 
(D) z |h(I,x)| x(Xsx) = £ (Q,)Z|h(I,x)| < £ V(h38.5E3X,) 
ja d jet gg 
< X V(hsE3X.) + €, V(h363E3X) < I V(h3E3X.) +e. 
j=1 a j=1 a 


Letting ô and € tend to 0, we have the theorem. 

In the language of Lebesgue theory, Theorem 5.6 shows that the variation 
is an outer measure. We use this to prove some results connected with 
integration by substitution. In particular, the integrals of f(x) k(I,x) 
and g(x) k(I,x) are equal if f = g k-almost everywhere. 


Theorem 5.7: If f is a fintte-valued point function on E° and k is a 


brick-potnt funetion such that 

(5.21) V(k;E;X) = 0, then 

(5.22) V(fk;E;X) = 0, | FoXlXzsJdk = 0. 
E 


Conversely, tf (5.22) is true and if Xo ts the set where f # 0, then (5.21) 
ts true with X replaced by X N Xg: 


(5.23) If f is a finite-valued point function on E°, if f(x)h(I,x) is 


integrable over E, and if 
(5.24) V(h - k3E) = 0, 


then fk is integrable over E and the two integrals are equal for all partial 
sets of E. 
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(8,25) 2f H(E,) = | gdh extsts for all parttal sets E, of E, the 
E, 
existence of either of | fdH, l fgdh implies the existence and 
E E 


equality of both. (Integration by substitution). 


(4.26) If for all partial sets Ey of E and two point functions f,g and 


a brick-point function k on ES, 


| fak = | gdk, 
Ey Ey 


thon F = g k-almost everywhere, (k need not be non-negative.) 


(0.27) If for all partial sets Ey of R, a point function f and two brick- 


point funetions h, k on EC š 


le fdh = | fdk 
1 E, 


n f = 0 (h-k)-almost everywhere. 


Proof: For (5.22) let X; be that part of X for which j-1 < |f] < j(j=1,2,...). 
Ihen V(fk; E;X;) < JV(KSESX5) < JV(k3E3;X) = 0. 


Iheorems 5.6 and 5.4 (5.11) give (5.22). For the converse we use the proof 
just given, replacing X by X N Xq5 and k and fk by fk and #7! (Fk) = ky 
respectively, which can be done in Xg: For (5.23) we use (5.24) just as 
(4,21) is used to get (5.22). Thus 


(4,28) l fd(h-k) = 0 

i 
for all partial sets E, of E. Theorem 4.4 (4.4) finishes the proof. For 
(4.25) we use Theorem 5.4 (5.8) and substitute H and gh for h and k in (5.23). 
For (5.26) we use Theorem 4.4 (4.4) to obtain 
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| gik = 0 
EY 


for all partial sets E} of E. Then Theorem 5.4 (5.8) gives 
V((f-g)k3E) = V((feg)k - | (f-g)dksE) = 0 


and the result follows from (5.22) implying (5.21) with X = E° and Xg the 
set where f-g # 0. (5.27) follows using Theorem 4.4 (4.4) for (5.28), and 
then a similar proof. 


Theorem 5.8: For disjoint elementary sets Ey Eo and 8p the positive 
funetion of Theorem 4.6 (4.7)let j be a positive function on ESG > Ages 


with ô as in Theorem 5.2. Then for an arbitrary positive function §* on 


Cc Cc 
E, U Ep”. 
(5.29) V(h56*5E,) +V(N3O*3E5) § V(h36*3E, U E,), 
(5.30) V(h383E,) + V(h;6;E,) = V(hS5E, U E2), 
(5.31) V(h3E,) + V(h3E5) = V(h3E, U E,). 


(5.32) If E} is a proper partial set of E with V(h3E) finite, £ > 0, 
and 6, a positive function on E° with V(h3535E) < V(h3E) + g€; 
then V(h3633E,) < V(h3E,) +e. 
Proof: If Sj is a sum of |h| for a 6*-fine division of Ej (j = 1,2) then 
S4 +S» is a sum for a §*-fine division of E, U Eos giving (5.29). Next, 
as 6 § õp» a sum of |h| for a -fine division of E} U E, gives a sum for 


E, plus a sum for E,, and the opposite inequality to (5.29) follows to 
give (5.30). Given e > 0, we choose F on E° with 


V(h36, 55) < V(h3E,) + he, 


Constructing & and using (5.30), 


V(hsE, UE.) <V(h363E, UE.) = V(h;6;E4) +V(h353E,) <V(h3E,) + V(h3E,) +ë 
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(4,33) V(h;E} U Ep) < V(h3E,) + V(n3E5). 


jn particular, if the right-hand side is finite, so is the left-hand side. : 
ne ; c 
lo prove the opposite inequality there is a positive function Sq on E, U Eo 


uch that 


V(h3dq3E, U E>) < V(h3E, U E2) +e. 


laking 5; = 64 on ESU = 1,2) we construct 6 and find from (5.30), 


v(nyE,) +V(h5E,) < V(n365E,) +V(n365E,) = V(h;8;E} U E>) < V(n3E, U Ep) + es 


2 


yiving the opposite inequality to (5.33) and so (5.31). Using (5.29), (5.31) 
in (5,32), 


HA 


V(h;633E4) V(h;833E) - V(h3633ENE,) < V(h3E) +€ -V(h3ENE,) =V(h3E,) + & 
theorem 5.8 (5.29) shows that S(E) = V(h36*;E) is finitely superadditive 

aver elementary sets E, so that there is a second definition of the 

variation of h, corresponding to the second definition of the variational 

integral, as the infimum of S(E) for all finitely superadditive functions S 

üyər partial sets of E and positive 6* that satisfy |h(1,x)| < S(I) for all 
‘fine (I,x). We use this definition in the next theorem. 

iheorem 5.9: h ¿s VBG* in X, relative to E, if and only if there are a 

j atthe fretten 6 tn p° , a non-negative finitely superadditive function S 
P bricks in E with S(E) > 0, and a point function k > 0, such that for 


Py -fine ERa 
(8,34) |h(I,x)| < k(x)S(T). 


roof: If h is VBG* in X relative to E, there are sets Xj (j = 1,2,...) with 
union X, in each of which h is of bounded variation relative to E. Clearly 
we can take the X, mutually disjoint. There are positive functions 8; on 

L° and finitely superadditive functions Sj such that for all 6;-fine (1.x), 
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When h 2 0 and both fh and |f|h are integrable over E, we prove 
later that F is h-AC* in EC and E, but the more difficult proof needs a 
‘limit under the integral sign' theorem, see Theorem 8.4. 

There is a property of regular measures (in the Lebesgue theory) that 


|h(I,x)| < S.(I) (x e€ Xj J = LES n 


We prove (5.34) by taking 


(x) =6.(x) (x €X.), 6(x) =1(x ¢ X), S(I)= 279s (1)/S.(E) k(x) = 255 (E) is not always true for general measures. Here we prove it true for the 
3 q d= J J variation. 
(x € Kj» j= 1,2,...), k(x) = 1 (x £ X). Theorem 5.11: If (x;) is a monotone increasing sequence of n-dimensional 
sets with union X, then lim V(h;E;X;) = V(h3E3X) for every brick-point 
Conversely, if (5.34) is true let Xij be the set where j - 1 < k(x) <j. , è ae , 
Then function h on E“ . More generally, for an arbitrary sequence (x,) of 


n-dimensional sets, 


|h(1,x)| < jS(1) 
(5.35) V(h;E; lim inf X.) < lim inf V(h;E;X;). 
joo jew j 
and h is of bounded variation on Xij for j = 1,2,..., and so is VBG* in X 
relative.to Es f é i Proof: As X >X, we have 
For two brick-point functions h,k in E” and a set X of R , we say that h J 
pe : ; WRI A č : 
ts k-absolutely continuous (k-AC*) in X and E, if for varying sets Xg cX, (5.36) V(h;E;X.) < V(h;E;X), lim V(h;E;X,) < V(h;E;X). 
j fies j 

V(h;E;X9) > 0 when V(k;E-Xg) > 0. 

Thus if the limit is +œ the result is true. If the limit is finite, we 


F z A c ; 
Further, h is generalized k-absolutely continuous (k-ACG*) in X and E, if prove the opposite inequality to (5.36). Let 6 (x) > 0 an E7 saetety 


there is a sequence (x) of sets with union X, such that h is k-absolutely 


continuous in each Xj and E. (5.19) V(n365 35X55) à V(h;E;Xx;) + e2 


Theorem 5.10: If f is a point function and h a brick-point function in EC, for j = 1,2,..., and define 
both real or complex valued, then fh is h-ACG* in E° and E, and tf fh zs 


integrable to F(E,) over partial sets Ey of E, then F ts h-ACG* in EC and E. 6(x) = 65 (x) (x Š XXi- J= Vatynees Xo empty), (x) = 1 (x exx). 


J 


If also h ts VB* or VBG* on X relative to E, except possibly where f = 0, 7 
If D is a 6-fine division of E and if Q, Q, are the subsets of D with the x 


in X and in XK respectively (j = 1,2,...), there is a greatest integer 
m, depending on D, such that Qn is not empty. Let Ej be the union of the 
bricks I from the (1,x) € QC = 1,...,m). Then from (5.19) and Theorem 

5.8 (5.31), (5.32), 


then fh , and F when fh ts integrable, are also VBG* on X relative to E. 


Proof: For the first part, for each set Xj where j - 1 < f(x) < j, fh is 
h-AC* in Xj and E(j = 1,2,...), and if fh is integrable to F, Theorem 5.4 
(5.9) shows that F is h-AC* relative to the same set. Next we use (5.34) 
in the proof of Theorem 5.9, with k multiplied by |f| , or by 1 if f = 0. 
If k does not exist at a point where f = 0, we multiply S by 1. 
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m m 
(D)E[h(I,x) [x(Xsx) = (Q)E|h(I,x)| = £ (Q.)Z{h(I,x)| < E V(h36.3E.3X_) 
j=1 J jel J Jog 
m 4 m 
< © {V(hsE.3X.)+e.2 9} < E V(hsE.3X_) tes V(NsE3X_) tes lim V(h3E3X.) +e, 
j=1 gaa j=1 Jep m joe j 


giving the opposite inequality to (5.36) and the result for a monotone 

increasing sequence. For an arbitrary sequence (X5), as fi X; is monotone 
j=m 

increasing in m, 


V(h;E;lim inf X,)=V(h;E; U N X,)=1im V(h;E; 
j>% J m=1 j=m mæ j 


8 


11> 


X5) < lim inf V(hsE5X5) 
m j>% 
as the last set intersection is contained in X, for every j 23 m, so that we 
can choose suitable j. Thus we prove (5.35). Note that if (X,) is monotone 
increasing to X, then (5.35) gives V(h;E;X) < lim V(h;E;X,), the opposite 
inequality to (5.36), so that there is equalize. i 

Some results are best seen in one dimension, though they have extensions 

to R”. We collect three here, beginning with integration by parts for 
Stieltjes-type generalized Riemann integrals. 


Theorem 5.12: Let F,G be point functions on the closed interval [a,b] with 
b-a finite. Let AF and dF stand for A(F;(u,v)) = F(v)-F(u) and dAF, respec- 


tively. If 


(5.37) FdG + | GdF = A(FG3;[u,v)) 


[u,v) 


ad 


for all [u,v) c [a,b), where both integrals exist, then 
(5.38) V(AFAG;[a,b)) = 0. 


Conversely, if the first integral exists in (5.37) for all [u,v) c [a,b), 
with (5.38), then the second integral exists and (5.37) is true. 


Proof: We use the identity 
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F(t) {G(t)-G(w) } =F (t)G(t)-F(w)G(w)-G(t) {F(t)-F(w) } + {F(t)-F(w) }{G(t)-G(w) } 
for t = u, w= v, and for t =v, w= u, with (5,37). Then 
V(AFAG;La,b)) = V(FAG +GAF-A(FG) ;[a,b)) awc] FdG +| GdF-a(FG);[a,b)) = 0. 


Conversely, if in (5.37) the first integral exists with (5.38) true, the 
identity gives 


vef FdG + GAF - A(FG);[a,b)) = 0. 


As A(FG) - | FdG is finitely additive, the first definition of variational 
integral just after Theorem 5.3 shows that GAF is integrable to A(FG) - | FdG 


on the partial intervals of [a,b) and (5.37) holds. 
To put this into a possibly more recognisable form we can substitute 
into (5.37), 


F(v) - F(u) = fdx, @(v) - Gu) = gdx, 


N T 


Fgdx + l Gfdx = A(FG;[u,v)), 
[u,v) 


(5.39) | 
Cu,v) 
in the sense that (5.39) holds if the integrals F, G and the integrals in 
(5.39) exist; and if the integrals F,G and the first integral in (5.39) 
exist, then the second integral there exists with (5.39). These results 
follow by using Theorem 5.7 (5.25), the integration by substitution, and 

by noting that (5.38) is automatically true. For by choice of the positive 
function 6 and given e > 0, we can arrange that 


|f(x)g(x)|S(x) < €, V(AFOG;63[a,b)) = v(#(x)g(x) (v-u)*;83La,b)) 


< V(e(v-u)363[a,b)) = e(b-a). 
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As true for all e > 0, (5.38) is true, and in the calculus form of the 
integration by parts we do not need (5.38). 


The formula (5.37) can be rearranged to a form that can be generalized 
to n dimensions 


(5.40) {F(x)-F dG = = 
lies x)-F(u)} fuy oY Gy) }dF, 


where y is the end of the interval I opposite to x. This follows from the 
identity without using (5.38). 


In n dimensions let F be a point function on Jo = [a,b] where 
a = (ags esea) b = (bjsss saD) a. <b, (j = 1,...,n), and for 
= z : Ç 4 ; 
u (Uys+--5U,), V (ViseeeoV) in J“ with uj < vj (j= 1,...,n) let us write 


ALF = A, (F3x,u,v) = a 
k kí u v) F(x, Xp aVE Xp spores oXy) 


peer eX pg Up oXpygaeee aX) 


AF = Al Fru) = OB, 45 eA F 


This last is the n-fold difference of F over [u,v). 
The repeated integral obtained by integrating over [a] b4) then over 
[ansbo)seees then over [a,.b.), is written using the symbolism | teal [a,b) “= 


This is usually equal to the integral over J, see Chapter 6. 


Theorem 5.13: If the two repeated generalized Riemann integrals in (5.41) 
exist then 


(5.41) | a k y A(F;a,x)dG = | sa i . A(G3y,b)dF, 


where F and G are point funettons on J. The second integral is equal to 


(5.42) | cand A(G3x,b)dF if 


leg 


(5.43)  V(|A(Gsy,b) - A(G3x,b) |A(F3x,y)3fa,b)) = 0. 


Proof: Use (5.40) repeatedly. 
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Theorem 5.12 shows that the converse is easy when n = 1, but it is not 
obvious when n > 1, since in (5.43) the difference of two AG is a function 
of b. 

Next we look at the Cauchy and Harnack extensions of the generalized 
Riemann integral. The Cauchy extension was first used on the Riemann 
integral to integrate some functions unbounded in the neighbourhood of a 
finite number of points, see Theorems 2.12 to 2.16. In turn, the Lebesgue 
integral was similarly extended, to integrate some functions that are not 
summable (Lebesgue integrable) in the neighbourhood of certain points. 
Another extension, first given by Harnack, extends the integral to an open 
set if the integral exists, possibly by Cauchy extensions, over each component 
interval of the open set and if another condition is satisfied. These 
conditions were first given in one dimension. Denjoy used both extensions 
repeatedly in a transfinite inductive process in order to integrate all 
derivatives, and beginning with the Lebesgue integral over certain sets. 
The theory is tremendously complicated, and only a dedicated student could 
hope to understand all its details. However, Perron defined a much simpler 
integral, and in the 1920's it was shown that the Perron and (special) Denjoy 
integrals are equivalent, they integrate exactly the same functions, to the 
same values. Now it is even simpler. Our generalized Riemann integral 
integrates all derivatives (see (3.16)), so that it is not surprising that 
the Cauchy and Harnack extensions are already included in the generalized 
Riemann integral. 

We first note a uniform continuity property of h(I,x) - H(I) if h 
integrates to H over all partial sets of the elementary set E. For, taking 
a single (1,x) out of D in Theorem 5.3 (5.6), we see that 


(5.44) given € > 0, there is a positive function S on EC such that if 


(1,x) is 5-fine, |h(1,x) - H(I)| < €. 
In one dimension, taking x = b (see Ex. 4.1) and I = [v,b) ¢ [u,b), 
h([v,b),b) + H(u,v) - H(u,b) > 0, h([v,b),b) + H(u,v) > H(usb). 


Thus we are led to the next theorem. 


67 


Theorem 5.14: (Cauchy extension) For each u,v ina su <v <b suppose Similarly we can deal with intervals whose left-hand end-points 


that h(1,x) ts integrable over [u,v) to H(u,v). If there exists | tend to a+. 
If we now assume a conventional value h([v,+~), + œ) = 0, the proof 


H, (u,b) = lim [H(u,v) + h([v,b),b)] of Theorem 5.14 suggests a definition of the integral over [a, + œ). We 
v= can use divisions of [a,+«) that consist of ([N,+0), +o) for some large N, 
then H(a,b) exists and ts equal to H; (a,b). together with a division of [a,N), and we can let N > + o while § shrinks, 
for some positive function 6 on [a,te). Taking h(I,+0) = 0, we have a sum 
Proof: For a strictly increasing sequence (v.) tending to b- as j + œ , with for such a division, and the definition of the integral follows as usual. 
Vo = as then H(vj qY) exists for j = 1,2,... Given e > 0, there are Theorem 5.14 with b replaced by + œ, shows that the new definition is 
positive functions 6; on [vj 49V4] such that if 0, is a 6;-fine division of equivalent to letting b + + œ in the integral over [a,b). The importance 
[vj a4)» of the new definition is that the proofs of results such as the various 
limits under the integral sign go through for [a,+%) just as they do for 
(5.45) |(0,) Dh(I,x) - H(v;_yovs)| PP 4 (i = edie [a,b), we do not have to use a further limit on the result. Similarly 
Theorem 5.11 (integration by parts) goes through. 
By definition of H, (u,b) and the finite additivity of H, if |v-b| <6p(b), Similarly divisions of (-«,b) consist of ((-osM),-«) together with a 
for some 6 (b) > 0, division of [M,b), with h(I,-«) = 0, and here -M is large and tends to +o, 
Also divisions of (-c, +o) consist of ((-0,M), -œ) and (LN, +o), too) and 
(5.46) |h(Cv,b),b) - H, (v.b) | <e. divisions of [M,N), with h(I,x) = 0 when x is replaced by -œ and + œ, and 
-M and N tend independently to +. The tests given to Theorem 2.12, 2.13, 
We now take 6(b) < Sg (b)s la) < 64 (a), 5(a) s v78, s(x) smin(6j(x),x-V;_q>Vj=x) 2.15-2.19 now have a wider application, to ensure that various generalized 
Riemann integrals exist. 
(j-i <x < vj) alv) £ min S5(V5) 985 Vj) (Fs TB cea!) The Harnack extension is as follows. For F cI = (u,v), u € F, where 


F is compact, the open set G = INF is the union of a sequence (y) of disjoint 


If D is a -fine division of [a,b) we can then split up D into divisions 0. open intervals. If a function f is integrable by some reasonable means over 
of [vj Yj) (J = 1,2,...39), for some integer J, a division D),, of [v sv) F and the separate I,, and if 
for some v in Vs sy S Vay and ([v,b),b). By (5.45), (5.46), and (5.5) © 
in the proof of Theorem 5.1, if Dy = DX(Lv,b),b), (5.47) z Mg 
k=1 
= J+1 J -i 
|(Dg)E(h-H)+h(Cv,b),b)-H4(v,b) | a St (0; )2(h-H) te <Ee.27 + at” a eo = BA is convergent, where 
j=l j=! 
f 
By the finite additivity of H, Mk = max] | fdx | 
J J 
|(D) Zh - H(a,v) - H, (vb) | < 2e, H(a,v) + H, (vsb) 2 H, (a,b), for all intervals J c Iņ» then the Harnack-extended integral over I, is the 


sum of the integrals over F and the separate I. 


and H(a,b) exists equal to H, (a,b). 
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To show what this implies, we note that the convergence of (5.47) | 
means that, given e > 0, there is a K with 


(5.48) y My Seg 
k=K 


i.e. if J for a finite number of distinct k 2 K, then 


cl 


k k 


(5.49) z | fdx| <e. 

Jk 
Conversely, if (5.49) is true and if we take first the positive integrals 
and then the negative integrals, we have 


zf fdx| < 2e, 
Jk 
for any finite number of distinct k 2 K, and so for all k 2 K. Hence 
(5.48) with 2e for e, and so (5.47) is convergent. Thus the following 
theorem shows that the generalized Riemann integral includes the Harnack 


extension process. 


Theorem 5.15: For F c [a,b] with a,b € F, compact, let G = (a,b)F, an 
open set and so the unton of disjoint intervals (u;.v,) wtth 
[ujv] cG (j =1,2,...). Given h(I,x), let 


(5.50) x(F;.)dh, | dh 


fess Cu; sv.) 


exist. If, given e > 0, there are an integer J and a positive function § 
on [a,b], such that for every finite collection Q of disjoint intervals 
[u,v) c [a,b), with [u,v) = Cu; .v5) or Cu; v5) n I, for some j 2 J, some 
-fine (1,x), and some x € F, and no two intervals [u,v) lying in the same 


[ujv we have 


(5.51) | (Q)z f dh| < €, 
u 


3 


then there extsts 
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dh = X(F3.)dh + È dh. 


(5.52) Ka j=1 ee 


lies 


Proof: By Theorem 4.4 (4.4) we can subtract the first integral in (5.50) from 
both sides of (5.52), and so we need only prove that 


dh 


1m g 


(5.53) i : x(G3.)dh = H ([a,b)) where H, (E) = 
a, 


jel (u,v; )ne 


for each elementary set Ec [a,b). As E is a finite union of intervals each 
of which has two frontier-points, [u.,v.) n E is either empty or is [u5sV5)s 
for all but a finite number of j. Hence by (5.51) the sequence of partial 
sums of the infinite series for H, (E) is fundamental and so convergent, and 
Ho exists. 

For equality in (5.53) we note that there are 65 (x) > 0 on Cu; v] such 


that 
(5.54) (D,) z |h(1,x) = Hp(T)] < 29, 


-fine divisi _ of [u.,v.). < min(é, “=X, XU, 
for all 6; fine divisions 0; o [uj vj) Now let 6(x) min(e, (x), v; X, X u) 


in uj <x < vje Let S(u,) s 6; (uj) and 6(u,) < Sp (Vy) if My = Uj for some 


KG Eiaa AS [ujv] cG, if x € F then x is not in Cu.,v.]. Thus, 
for J as in (5.51), we keep S(x) > 0 so small that (x-d(x), x +6 (x)) does 
not intersect Cu; »vj] with j <J. If D is a -fine division of [a,b) let 
(I,x)€D. If x € F then x(G@3x)h(I,x) = 0 and I can only overlap with [u;.v,) 
if j 2 J. Thus the sum of the H(I) for such I, will be an infinite series, 
the limit of a sequence of sums over certain Q satisfying (5.51), and the 
modulus will not be greater than e. If, on the other hand, (I,x) €D, x EG, 
then for one or possibly two integers j, x € [uj Vj] and (I,x) is 6;-fine. 
By the construction of 6 it follows that for all j < J the Cu;.v,) are 
divided by (1,x) € D with x € G, and a finite number of tu; >v,) with j 2d 
are divided wholly or partially, and using (5.54), 

J-1 A œ 


|(D) 2x(Gsx)h(1,x) - H,([a,b))| < Z e2] 42 2 e2 + e<3e, 
j=1 j= 


which proves the theorem. 
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Combining Theorems 3.4, 5.13, 5.14, we see that at each stage of 
construction of the special Denjoy integral from the Lebesgue integral, 
the construction is a generalized Riemann integral. Hence the special 
Denjoy integral is included in the generalized Riemann integral. The 
latter is wider as it can integrate some interval functions as well. 


Ex. 5.1. Let hy shy be brick-point functions on EC and a, B be constants. 
Then 


V(ah, + BhosE) $ |olV(hysE) + [B|V(h2;E). 
Ex. 5.2. If in Ex, 5.1 hy shy are real-valued and h = h; + ih, then 
V(h;;E) < V(h3E) (J = 1,2). 


Ex. 5.3. If on pe hy is variationally equivalent to hy and uP is variation- 
ally equivalent to has then hy is variationally equivalent to ha. 


Ex. 5.4. If Xo Xi Ec Ejo prove that if h is a brick-point function on EC, 
V(h3E3X) < V(h;E4 5X4). 


Ex. 5.5. If E,»E, are partial sets of E with a and E disjoint, and if h 
is a brick-point function on gf, then 


. . c . . c = . . c c 
V(h3E3E, ) + V(h3E3E, )= V(h5E3E, U ES J> 


Hint: There are disjoint open sets G,, G, with 6; > E} (j = 1,2). Now use 
Ex. 4.2. 


be a proper partial set of E and Ez = ENE). If Xn ES and 


Ex. 5.6. Let E 
Cc 
J 


YN Ey are empty, then for a brick-point function h on E 


V(h3E3X) + V(h3E3Y¥) = V(h3E3X u Y). 
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Ee. 5.7. Let (1,°) be a finite or infinite sequence of interiors of 
disjoint bricks with union J. Then 


V(h3E3J) = 


um g 


V h;E;1.° 2 
( j ) 


Hint: The left side is not greater than the right, by Theorem 5.6. Now 
take a finite sum and see Ex. 5.6. 


1 


Exs. 5.8. In R leta <b, [a,b) cE, and lim V(h;E;[a-e, ate]) = 0. Then 


c>0 + 
V(h;E;[a,b)) = V(h;E;(a,b)). 
On the other hand, if lim V(h;E;[b-€, b+e]) = 0 then 
€>0+ 


V(h3E3[a,b)) = V(h3E3;Ca,b]). 


If both limits are 0 then V(h;E;X) = V(h;[a,b)) when X = (a,b),(a,bJ,[a,b), 
and [a,b]. 


Ex. 5.9. If h is a brick-point function on E° and if |h| is finitely 
subadditive (in particular, if h is finitely additive) then |h(I)| < V(h;1), 
which may be +”. If |h| is finitely superadditive then h is of bounded 
variation and |h(I)| 2 V(h31). If |h| is finitely additive, then h is of 
bounded variation and |h(I)| = V(h3I). In particular, if V*(J) = V(h3d) 

(J <I), prove that V(h3I) = V(V*31). 


Ee, 5.10. Let the brick-point function h be of bounded variation in E and, 
given € > 0, let the positive function 6 on ES be such that 


V(h363E) < V(h3E) + £. 


Then there is a 6-fine division D of E such that every subdivision D, cD 


satisfies 


(D,) = |h| > Vhs U I) - 2e, 
> 
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Hence for fixed x € EF and 6-fine (I,x), 


lim sup |h(I,x)| = Tim sup V(h3I). 
S(x) +0+ S(x) + 0+ 


Hint: Choose D so that 

(D) Z|h| > V(h363E) - e 2 V(h3E) - e. 
Then by Theorem 5.8 (5.31), (5.32), 

(D,) = |h| = (D) E|h| - (D~D,) = |h] 


> V(h;E) - e -V(h;EĒx UI) - © = V(hs u I) - 2e. 
v vi 
Now take an x € ES, By the construction of Ex. 4.1 and since V(h;E) is the 
infimum of the V(h;6;E) we can assume that S(y) < |y-x| (y € E°, y # x), so 
that D has to contain at least one (I,y) with y = x. Let (6,) be a sequence 
of such 6 with 5 (x) > 0+ and with € =e > 0+. If (1, 2x) is §,-fine and in 


a suitable Dv 
[n(I,4x)| > V(hsI,) = 2e,, 


lim [h(I 4x) | 2 lim V(hsI,). 
neo now 


The reverse inequality is easy and the proof can be completed using these 
notes. 


Ex. 5.11. (Cantor's ternary set) On the real line Tet h([u,v),x) = v-u 

and let [a,b] c [A,B]. Prove that vV(h;[A,B);[a,b]) = b-a and give a similar 
result for a finite number of disjoint closed intervals. From Cy = [0,1] we 
remove the middle third open interval Ge 5 to get C,. We then remove 

the middle third open interval of each closed interval in C, to get Cos 

and we repeat this process to have a sequence (C.) of sets each consisting 
of a finite number of disjoint closed intervals. Prove that V(h;[0,1);C;) 

= (2/3)]. The intersection C of all the c} is called Cantor's ternary set. 
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Find V(h;[0,1);C) and | x(C;.)f(.)dh where x(C;.) is the indicator 
[0,1 


of C and f is an arbitrary function of points. 
(New University of Ulster 1983, M313) 


6 The Integrability Of Functions Of Brick-point Functions 


To deal with this subject in Lebesgue theory, in which the brick-point 
functions are of the type f.(x)u(I), a great use is made of the concept of 
measurability of the point functions f.(x). Later we will show that an 
integrable function F0du() is such that f.(x) is y-measurable, but here 
we use a different method, which also deals with functions of the more 
general h.(I,x). We could use a continuous functional of hj for the j in 
some set J, but for simplicity we use a function P(X pa 200 9X) : R” > R and 
consider the integrability of r(hys---oh), given that the Nyseeeoh, are all 
integrable over the same elementary set. 

It seems that r has to be continuous in some sense; here we use two 
senses. First, 


(6.1) [Yge Yg) T Oee $ Ag l¥y-Xy] # eee + ALY! 


m 


for constants Aj > 0, e.g. [ər/əx;| < A; (j = 1,...,m). More generally, 
for each e > 0 and each XyoeeesXns WE put 
S(Xy see 9X _3€) = sup [rly,o++esY ) rixe eX CEY Se Jelyn Ma 


m 


Then the continuity condition is that for an arbitrarily large positive 
integer p, varying Eg? 0 (q = 1,...,p), and fixed 


p 
(6.2) E Xe (Bi = asais 


p 
(6.3) X 


Clearly (6.1) implies (6.3), but (6.3) is more general than (6.1). 
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We also assume that 


(6.4) P(X) + VyseeeoXnt¥n) $ (Xp 2000oXq) + (Vy 9000 Ve) 


for all (Xp oe00 9X) (Yjet eyn) in R, 


It may be that there are better conditions on r to give what we require. 


Theorem 6.1: Let Henk. be of variation zero in the elementary set E for 


j = 1,...,m, where the hak, are brick-point funettons defined in EC, Then 


(6.5) r(hjseeeshp) - r(kys++esk,) 
has variation zero if either r satisfies (6.1) or the hj are integrable in 
E and r satisfies (6.3). 


Proof: Given s > 0, there are positive functions j on EF such that 
6.6 K.36.3E j = 1,...,m). 
(6.6) Von, 53; d<ee (j=l m) 


The § that is the least of the 6. at each point, is still positive and can 
replace the separate j in (6.6). Then (6.1) gives, for each 6-fine 
division D and using (6.6), 


D) Z |r(hy see sh ) = r(k k )| < (A 


i 12e eok) | $ jreeetA de 


and (6.5) has variation zero. 
If hj is integrable to Hj for each j let the brick-point pairs of D 


> a TS d ov, S Ae 5 Ya. = Hie z i 
be (I, Xq) a= p) and put ar jig Ysg hiag Then (6.2) 


is fixed at H; (E) and we can take 
m p m 
7 Huse J ES --H.303E) < me, 
€ z Von, jô g z € z Vhs H; 63E) < me 
using Theorem 5.8 (5.29) and (6.6). From (6.3), (6.5) has variation zero 
4 by H, (j = 1,...,m). ik. i. iati 
when kj is replaced by i Get m). As hj 3 and h; m have variation 


zero, so has krHs and H is also the integral of kje Hence (6.5) has 
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variation zero when h, is replaced by Hj (j = 1,...5m). Putting the two 
results together, we finish the proof. 


Theorem 6.2: In Theorem 6.1 with the hy integrable, let r satisfy (6. ats 
Then r(hyseeeoh,) is integrable tf, pr, only tf, for a positive 6 on E° and 
all ö-fine D over E, 


(6.7) (D) Er(hys+++sh,) 


ts bounded above. 


Proof: By (6.4) the brick function r(Hy(1)5..+ 9H, (1)) (I cE) is finitely 
subadditive. 


Theorem 4.7 then gives the results when Hj replaces hj» and Theorem 6.1 
finishes the proof. 


Theorem 6.3: Zet h.,k, be brick-point functions with h,-k, of variation 
zero in E°(j = Wins rath Then max (hse ehg) - max (kysees oK n has variation 
zero. If also each h. is integrable to H; then either and so both of 
max(hy>+- vhs max(H, »+++sH_) are integrable if and only if there are a 
bounded above set S and a positive function § on ES such that 


(6.8) (0) Eh sq yy (E>) eS 


for all §-fine divisions D of E and all choices j(1,x) from (1,2y.005m)s 
Further, (6.8) holds if h is integrable over E and etther 


1,...5m) for all 6-fine (1,x) in E°, on 


IA 


(6.9) h (1.x) < h(1,x) (j 


h(1,x) (j = 1,...,m) for all 6-fine (1,x) in EÔ. 


IV 


(6.10) h; (1,x) 


Conversely, if max (hy s.. shp) is proved integrable, it is a suitable h, 
and oren ehg is also integrable. When h; (1,x) = FO)k (15x) with 
k(I,x) 2 0, j(1,x) can be independent of I. 
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P : . = y, 5 ."y. = a 
roof: x, (x, Y5)+Y; s [Xs-¥5] + yj s [Xy] tee et XE Mal * max (yj s.e eyn) 
MAX (X49 +++2X-) - MAX(Y4 see Yg) s [pay] teet [Xan] 


Interchanging the x's and y's gives (6.1) with A; = 1. For (6.4), take the 
maximum of 


Xi tyy 8 MAX (X45 ee+9X_) + MAX(Yyo+++ Yn) 


From Theorems 6.1, 6.2 we can now read off the first results. (6.9) implies 
(6.8) since 


)sh, (D)Eh < H(E) +e 


for all 6-fine divisions D of E and suitable s(x) > 0, where H is the 
integral of h. 


If (6. 10) holds, so does (6.9) since 


m 
h. 2h, h.-h <s z (h_-h), h, < - - š 
Ai ) e hg (m-1)h 


If (6.9) holds, possibly with h = max(hys-+.sh,)s then 
hg-(m-1)h, min(hy 5... i. qee eha) 


Thus the max and min are integrable together. Finally, in (6.8) the best 
choice of j(I,x) is given by 


hicr) (I) = max(h,(I,x),.+.9h,(15x)), 


TF h;(1,x) = fj ()K(T,x) with k(I,x) 


IV 


0, 
max (hy (13x)... 3h (1x)) = k(I,x)max(f}(x)s... sf (x)) 


m 


and the choice of j(I,x) can be made independent of I. 
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Theorem 6.4: If t is fixed in 0< t< 1 and if in EF, f(x) 2 0, g(x) z 0, 
h(1,x) = 0 with fh and gh integrable over E, then pig! th is integrable over E. 


Proof: Here, r(x,y) = gy and we use HUlder's inequality. See Section 12, 
“Vy g =1,u20,v20,s20,w20, 


for example. Then for p> 1, q> 1, p 


uv +sw $ (uPasP)'/P(yIaw9) 1/9, say +Z) i * < (x44241) tz) t 


(x,20, z,20, j=1,2), 
coc ae ol oA bei & oe oe ae 
on putting t=p , 1-t=q , xX, =U, Zz, = ST Xo = V's 25 = w', and -r 
satisfies (6.4). Thus on refinement of partitions sums fall and they are 
bounded below by 0. 

The mean value theorem shows that (6.1) is not satisfied. But (6.3) is 


satisfied, showing the need of something beyond (6.1). We begin with 


(6.11) f(x) = ERF (x+z)? 


proved by 
f0) = 0, f'(x) = tlt - ioan t 20. 


Thus we prove that 


t 1-t t i, 1-t 1-t 
(6.12) (x,+2,) (xXp4Z9) < Oxy #124 | (xy +z] 7) 
= el + K(x, 2 0, Yj = as 2 0, JA 152)’ 
to Tt Ç 1-t t l-t 
K= |Z, | Xo + Xy |Z.| + |Z, | |Z5| z 


Next we prove that 


t l-t t 1st = Ha 
(6.13) 4y A (x424) (X323 + K(x; 20, yj = xj+Z5 2 0, j= 1,2), 


clearly true if zj 2 0 (j = 1,2). If z} < 0, (6.11) gives 


t 


t t t t 
1s (x,-[241) + jz (x,#Z4) + |z 


(6.14) x 
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-È to give (6.13). 


Similarly when z} 20, z, < 0. If z, < 0, z% <0, (6.14) gives (6.13) since 


When z} < 0, z, 2 0, (6.14) need only be multiplied by A 


t 1-t 
X4 Xp 


IA 


(qez) + AE {(xy#2,)'7* + jeg 


t 1-t t 1-t 
+ (x4+z4) |Z5| + |z] (xp4Zy) 


LL] 


t 
(x, +24) (Xp4Z5 
t 1-t 
+ |2,| 29 


(x4424) Catz) +K. 


IA 


Then (6.12), (6.13) give 


(6.15) [r(x X2) = r(y4 Y9) |£ K =r(|z;l>x2) +r(x>|z21) +r(|z; l> |221); 


zj =yj=Xj (j = 1,2). 


From (6.15), using Hdlder's inequality, (6.3) is true since 


p p 
E a ty í E te + r(X492€,) +rleqseq)) 
( p p ( p p ) p 
<r E és yx E-F Xe ne EE a ag > 
g=1 T gt 24 get 9 git gi 4 


which tends to 0 with the last sum when (6.2) is fixed. 

Previously, when m = 1 I have considered the requirements that r be 
convex and satisfy (6.1) when integrating r(f(x))h(I,x). It turns out that 
r is then rather restricted in scope unless f has a bounded range of values. 
For if r has a second derivative everywhere, convexity implies that that 
derivative is non-negative, and so the first derivative is monotone 
increasing, and is bounded because of (6.1). Thus even r(x) = xê is not 
included, except for a bounded range. To increase the generality we divide 
R up into a sequence (x) of sets such that A, = 29 in Xj» and still obtain 
what we need. A similar generalization can be used when r = r(x eX) 


Theorem 6.5: Let the convex funetion r:R > R, let R be the union of disjoint 


sets Xj (j = 1,2,...) such that for some positive function St on Ra 
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(6.16)  |r(y)-r(x)| < 25 yx | (x E X; |¥-x| ee Gly T Aa 


let h 2 0 be integrable to H, and let f be a point function such that fh ts 
integrable to H,. Then r(f)h is integrable if and only if the set of sums 


(6.17) (D) zr(f)h 


ts bounded above, for some positive function §* on E° and all 6*-fine 


divisions D of E. 


Proof: Let Wj be the set of x € EĈ such that f(x) € Xs and let 8; bea 
positive function of EC such that 


(6.18) (D) z |fH - Hy | < e4 


for all 6.-fine divisions D of E. By the differentiation of integrals 
(see Section 15) and disregarding those I with H(I) = 0, and a set of 
H-variation zero, 


f(x) - Hy/H] <8" (F(x)) 
for all 6°’-fine (1,x) and some positive function s**t on Eĉ. Now take the 
positive function 6(x) < min(é*(x), 6;(*), 6°" (x)) when x € Wes J = 1525-5. 
If D is a 6-fine division of E° let Dj be the subset of the (I,x) € D with 
xE Wj. Then 


(D)z|r(f)H-r(h4/H)H| = (P)z|r(f)-r(H}/H) |H < z 25(0,)n|f-H,/H|H 
j=1 


25 (0; )z|fH-H | < reg -e 


TN 
im g 


$ 


using (6.16), (6.18). As H can replace h in (6.17) and the integrability 
argument, see Theorem 5.7 (5.23), then by Theorem 4.7 it is enough to show 
that r(H,/H)H is finitely sub-additive. Let P be a partition of a brick I, 
formed of bricks J. Assuming H(I) # 0, 


H(J)/H(I) 2 0, (P) SH(J)/H(1) = H(1)/H(1) = 1. 
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Thus by the convexity of r, 


r(H,(1)/H(1)) = r(P) ZH,(d)/H(I)) = r(P) Z (H,(J)/H(d)) (H(J)/H(1))) 


< (P) Zr(H,(J)/H(J) HCD )/H (1) 
r(H,(1)/H(I))H(1) < (P) Z r(H4 (J)/H(J))H(J), 
proving the finite subadditivity of r(H,/H)H. 
Prove Theorem 5.4 (5.14) using |h| = max(h,-h) with real-valued h. 


Ex. 6.1. 


Let #(x)h(I,x) be integrable over E, let |f(x)| < g(x) and let 
Prove that |f(x)h(I,x)| is integrable 


Ex. 6.2. 
g(x) |h(I,x)| be integrable over E. 
over E. 


Ex. 6.3. For f(x), h(1,x) defined on Eĉ and a a real constant, let h, |h|, fh, 


|fh| all be integrable over E. Prove that max(fh,ah) and min(fh,oh) are 


integrable. 
Hint: (|f| + |a|)|h| is integrable. 
i ee 
Ex. 6.4. For real x,y let r(x,y) = (x°ty°)* = |x+iy|. Then 
Ixtiy| - |reis|| < [(xtiy) - (rtis)| < [x-r] + ly-s], 


WA 


[(x+r) + i(yts)| < [xtiy| + |r+is]. 
Hence this r satisfies (6.1) and (6.4). 


Be. 6,5. CEt ay 
(aj 2a) one vertex and (b bo) the opposite vertex. Let F be a real-valued 


< b} ay < Do» and let I be the two-dimensional brick with 


or complex-valued point function on R and define 
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G, (F51) = |F(x3b,) - F(x,a5) |dx, 


lai 2 


G, (F;I) = 
É er 


|F(b,.y) ii F(a; sy) |dys 

assuming that these generalized Riemann integrals exist for some brick J 
and all IcJ. 
a brick J, prove that 


If G)»G, are themselves generalized Riemann integrable over 


G(F;1) = {G,(F51)* + G,(F;1)? + (by-a4)"(by-ay)1* 


is also generalized Riemann integrable over J. 


Hint: This needs the three dimensional analogue of Ex. 6.4, e.g. by applying 
it twice. Also G, is finitely subadditive in [aysbo), so that integrability 
of G, implies that certain sums are bounded. Similarly for Gos so that 
certain sums for G are bounded. 


7 The Variation Set 


Given an elementary set E and a positive function 6 on ES, let Q be a 
finite collection of 6-fine brick-point pairs (I,x) with mutually disjoint 
Let h(I,x) be defined in E°. 
division of E and 


I lying in E. Then Q is called a -fine partial 


(Q) Zh(1,x) 
a 6-fine partial sum in E. The collection of values of all such sums with 
fixed E, including the empty Q with conventional sum 0, is denoted by 
VS(h36;E), the variation set of h on E, restricted by 6. If X = R" and 
x(X;x) is the indicator of X, the variation set VS(hX(X;.)363E) of h on X 
and E, restricted by 6, is denoted by VS(h363E;X).» The intersection VS°(h;E) 
of closures VS(h363E)° for all positive functions § on ES, contains 0, and is 
called the limiting variation set of h on E, and VS°(h;E3X) stands for 
vs°(hx(X;.)3E), the Limiting variation set of h on X and E. As a division 


of a partial set E, of E, is a partial division of E, a definition from Sec- 


1 
tion 4 gives 
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(7.1) S(h383E,) c VS(h;3;E) (E; cE). 


The definitions can hold in value spaces that do not have a modulus 
or norm with relevant properties, and then the variation set can partially 
substitute for the variation. In any case it is useful to know how far we 
can go with variation set theory. 

When the value space has a modulus or norm \|h || » then 


p 
h_ ||» 
ha II $l 


with VS(h363E) in the closed sphere, centre the origin and radius V(h363E), 
vs°(h3E) lying in the closed sphere, centre the origin and radius V(h;E). 
However, we can be twice as accurate by taking more care. We begin with 


a lemma, 


Lemma 7.1: Let real numbers XyoeeraXy have sum S, and V the sum of the 
moduli. Then any subset of the numbers has sum T Lying between 3(S + V), 


the bounds being attained, 


Proof: The sum of the other numbers is U = S-T and 2T-S = T-U, |T-U| < V, 
even for T = 0 or S. If all positive Xj and no others are in T then 
T-U = V. Ifall negative xj and no others are in T then T-U = -V. 


Theorem 7.2: For H a real-valued finitely additive function of partial sets 
Ey of E, the set of all such H(E,) ts bounded if and only if H is of bounded 


vartation on E, and then 


(7.2) U(E) = sup H(E,) = 3(H(E)+V(H3E)) 20, L(E) = inf H(E,) =3(H(E)-V(H3E)) < 0, 


over all partial sets z of E. Hence as E varies, U(E) and L(E) are finitely 


additive with 
(7.3) H(E) = U(E) + L(E), V(H;E) = U(E) - L(E). 


(7.4) If H is not of bounded variation sup H(E,) = +œ, inf H(E,) = =o, 


84 


(7.5) If £ > 0 and a proper partial set Ej of E satisfy H(E,) > U(E) - e, 


so that U(E) and V(H3;E) are finite, then L(E,) Be es U(EXE, ) < gs 


Result (7.3) is the Jordan decomposition of a finitely additive function 
of bounded variation of bricks, into its positive variation U(E) and its 
negative variation L(E), just as in one dimension a function of bounded 
variation is the difference of two monotone increasing functions. 


Proof: Let a partition P be a partition P} over a partial set Ejo together 
with a partition Py over ENE, if Ey is a proper partial set of E. In Lemma 
7.1 let the Xj be the H(I) for I € P. Then as H is finitely additive, 

S = H(E) and, for P fixed, 


V= (P) 5 |H(I)|, so that 3{H(E) + (P) z |H(TI) |} 


are the attained bounds of H(E,). Now |H| is finitely subadditive, so that 
if V(H;E) is finite, by refinement of partitions 


(P) z |H(T) | 


tends upwards to it, giving the values of U(E) and L(E) in (7.2), these being 
finitely additive in E since H and V(H;E) are, see Theorem 5.8 (5.31). 

If V(H3E) = +œ, (P) z |H| and so + H(E,) can be arbitrarily large, 
giving (7.4). 

Finally, from H(E,) > U(E)-e and (7.3), (7.2), we have (7.5) from 


L(E,) > U(E)-U(E,)-e = U(ENE,)-e 2 ~-e, U(ENE,)-e < L(E,) 0. 


WA 


If the value space has a modulus or norm and the variation defined using 
In|} 


JHCE,)-35|| < (P) z [BHI] s V(ahsesE), []8S|| < V(2h56;E). 


Returning to real or complex values, by the convention 0 € vs°(hsE), 
and 0 is the only possible limit of VS(h3;6;E) as 6 shrinks in E“. If 0 is 
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the only point contained in vs°(h3E) and if the closure of VS(h363E) is 
compact for some positive function ô on E, we say that h is of variation 
zero on E. To show that this definition agrees with the previous V(h;E) = 0 
we use the next theorem. 


Theorem 7,3: For a positive function § on E° and a compact set C let 


(7.6)  VS(h383E) cC. 


Then, given a sphere G centre the origin of the value space of real or 
‘ i Piaf ; c 
complex numbers, and of radius e > 0, there is a positive function §* on E 


such that 

(7.7)  VS(h;6*3E) c VS°(h3E) + G. 

(7.8) If (7.6) is true with VS°(h;E) = sing(0) then V(h3E) = 0. 

Note that if A,B are sets, A+B denotes the set of a+b for all a € A, all b € B. 


Proof: In (7.7) the right side is a union of open sets and so an open set 
G*. F = C\G* closed and lies in a compact set, and so is compact. If y € F 
then y EXVS°(h;E) so that there are a positive function s (x) on E and 

an open neighbourhood G(y) of y free from points of VS(h;6 3E). The Gly) 
cover F, so that a finite number cover F, say Gly, ),-.-.G(y,). Let 


6* = min(s, seess ) > 0. Then Fn VS(h;6*;E) is empty, proving (7.7). 
1 P 


In (7.8), VS(h;6*;E) c G, so that S(h35*5E,) c G for each partial set 
Ey of E, and h integrated to 0 on Ey. Hence V(h3;E) = 0 by Theorem 5.4 (5.12). 
The converse is easy. 


Corollary: It follows from (7.8) and Theorem 5.3 (converse) that h zs 
integrable to H on the partial sets of E if and only if H ts finitely 
additive, VS(h-H;6;E) lies in a compact set for some positive function ô on 
E°, and VS°(h-H3E) = sing(0). 
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Theorem 7.4: If E, ts a proper partial set of E and h a brick-point function 


on pe then 
(7.9) VS(h365E,) +VS(h363ENE,) c VS(h;6;E), VS“ (hsE,) +VS°(h3ENE, ) <W(hsE). 
(7.10) If (7.6) is true, there is equality in the second result of (7.9). 


Proof: In (7.9), putting E, = EE}, if yj € VS(h;6;E;) (j = 1,2) then 
yY = y4*Y9 is a sum for a 6-fine partial division of E, which gives the 
first result. For the second we use the continuity of addition, so that 


if X and Y are sets in the value space, X? + Y° c (X+Y)°. 


(7.11) VS(h3E,) + VS°(h3Ey) © VS(h363E,)° + VS(h363E,)° = 
{VS(h363E,) + VS(h365E5)}° c VS(N38sE)°, 


Taking the intersection for all positive functions 6 on EP finishes the 
proof of (7.9). 

If (7.6) holds, then as 0 € VS(h365E,) (j = 1,2) we can replace E by E} 
in (7.6). 


C= (Fo ¥*. 


Lemma 7.5: If X oC, then Cay 
Proof: Letw e€ (X + ye: If there are no x € X, y € Y such that x+y = w, 
then there are (x;) E Ka (y;) c Y such that xj+y} € X+Y and Wj = Xj*Wj >W. 
As (x,) c C, the sequence has a limit-point, say x, and a subsequence of 
(x5) tends to x. The same subsequence of (y;) = (w5-x5) tends to w-x, and 
xX € xc, w-x € ye, we KE g aA proving the lemma. 

Returning to the theorem, let 5p be the positive function of Theorem 
4.6 (4.7) and let ô< Sp on E} UE, = E. By Lemma 7.5 and the proof of 
Theorem 4.6, the second and third signs c can be replaced by =, so that if 
w € VS(h36;E)° for all 6 then w = x, (8) + xs (6) where x; (8) € vS(h3653E5)° E Ga 
The collection of x,(6) for all positive 6 on E°, has a limit-point 
x, € vS(h353E, )° for all positive & on E and a sequence of x, (6) tends to 
X4: By continuity of addition the corresponding sequence of x (6) tends to 
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WX) in VS(h36*3E,) for all positive 6* on A Hence the result. 
For a while we fix h and E and vary the X alone, so that for simplicity 


we can put VS(X) for vs°(h3E3X). Let (Y) be a sequence of sets in the 


<% œo 
value space R or C. Then the symbols ZŁ Y; and Z Y3 are defined as 
j=1 j=1 
follows. 
<% m œ œ k 
EY.={2 ya : ys€Y¥s, J=l,....m, for m=1,2,...}, 2 Yee Z Ya? 
j=1 J j=l J J J j=1 j=1 


Theorem 7.6: (7.12) If X¢T CR" then VS(X) c VS(T). 
(7.13) Let (xj) be a sequence of sets in R" with union X, let (C;) 


be a sequence of compact sets in the value space, and let (85) 


me: 3 č 
be of positive functions on E`. 


-6.: aM . ts hoes 
(7.14) rf VS(hs j3EsX;) EG (iS kea 


(7.15) then VS(X) c = VS(X.). 


Results (7.12), (7.15) are the respective analogues of Za, 5.4 and Theorem 
5.6. 


Proof: (7.12) is trivial. Using it, we can assume that the K; are mutually 
disjoint. By (7.14) and Theorem 7.3, given € > 0, there is a positive 
function i on EF with 


sOn PESK f ; 
(7.16) VS(h; j 3E3X 5) = vs(x) + G; 


where Gj is the closed sphere centre 0 and radius e.2%. Take 


IA 


S(x) < 55*(x) (x € X 5) (x) < 1 (x ENX). 


If Q is a 5-fine partial division of E let Q; be that part of Q with 
xE Xj. As Q has only a finite number of brick-point pairs there is a 
greatest integer m with Qn not empty. Then, using (7.16), 
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m m m 
(Q) E hx(X3x) = 2 (Q,)E h € Z VS(h36.*3E3X.) c Z {VS(X.) +G.} = 
neat Pa j Gf SS a j j 
J 1 J 1 4 4 
m m m 
= E VS(X.) + Z Gc X VS(X.) +G, 
ja ee ee o 
<% © 
VS(h3S3E3X) = E VS(X.) + Ga, VS(X) cF +G,F= 2 VS(X.). 
-- j=1 J 0 — 0 j=i x | 


If y €XF, there is a sphere S centre y and radius some 2e > 0, that is free 
from points of F and then every point of F + Go is at most within e of the 
boundary of S and so a distance at least e away from y. Hence VS(X) c F, 


proving (7.15). 


Ex. 7.1 We do not always have Xe g yF (x+7)°, For let X be the set of 
negative integers and Y the set of points n + 3 + 1/n (n = 3,4,...) Then 
x = x, Yo = Y and (X+Y)° contains 3, which is not in X° + Y°. 


Ex. 7.2 (P.J. Muldowney) The analogue for variation sets of Theorem 5.8 
(5.32) could be as follows. If VS(h363;E) lies in a fixed compact set for 
each positive 6 on E if G is an open neighbourhood of the origin, and if 
the positive function 6, on E° satisfies VS(h38,3E) £ vs°(h3E) + G, then 
VS(h38,3E,) © vs" (h3E,) + G for all partial sets E} of E. However, the 
following example shows that the result is false. Let > 


v-u([u,v) c [1,2)), 
h(Cusv)sx) = f u (0<u<1, vel), 0<6(1) = 6<1, 0<6(x) < [x-1 (x#1) 
0 (otherwise). 


then VS(h363C0,2)) = C0,2), VS°(h3£0,2)) = [0,2], 
vs(h363(0,1)) = (1-6,1) u sing(0), VS®(h;[0,1)) = pair (0,1). 
Taking G = (-e,e) with 0 < e < 6 we contradict the supposed analogue. 


Ex. 7.3. We need the compact set in Theorem 7.3 (7.7). For if in R, h([0, 
1/n),x) = n (n = 1,2,...) with h = 0 otherwise, then V(h;6;[0,1)) = + œ, 
VS(h363[0,1)) is unbounded, and yet vs? (h; [0,1)) = sing(0). 
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CHAPTER 3 
LIMIT THEOREMS FOR SEQUENCES OF FUNCTIONS 


8 Monotone Convergence 


Chapters 1 and 2 lay the foundations on which we build more profound 
properties of the integral. Exx. 4.8, 4.9 show that in this section we 
cannot consider the most general monotone sequence of brick-point 
functions. Instead we use a non-negative brick-point. function multiplied 
by a monotone increasing sequence of point functions. We first have a 
weak form of the monotone convergence theorem which in Lebesgue theory 
is due to Levy. 


Theorem 8.1: For E an elementary set, k(I,x) 2 0 in E°, and (F 5(x)) 
bounded and monotone increasing in j, so that f(x) = lim f.(x) exists 
and is finite, for each x € EC, if £.(x)k(I,x) is PIF sa A over E to 
H; (E) (j = 1,2,...) and if the sequence (H.(E)) is bounded above with 
supremum H(E), then f(x)k(I,x) is integrable over E to H(E), i.e. 


(8.1) | lim fj(x)dk = 1im | f.(x)dk. 
E j>% j> Jg 2 
Proof: By Theorems 5.1 and 4.4 (4.5) the integral H, of f.k exists, 


monotone increasing in j, for each partial set of E. As (H; (E)) is 
bounded with supremum H(E), given e > 0, 


(8.2) R(E) -e< Hy(E) ¢ H(E) 


N 


for some integer N. By Theorem 4.4 (4.4) we replace f; by fi-fi 2 0, so 


that without loss of generality we assume that f; 2 0. By Theorem 5.3 


(5.6), for a positive function 8; on Ec, 
(8.3) (D) o|f;(x)k(I.x) - HD] < gpn 


when the division D of E is sj-fine (Cf S ET T 


Suppose that a function m(x) taking integer values not less than N, 
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is on E°. Define §(x) = Sin(x) %? > 0 and let D be a 6-fine division of 

E using (I,x), with u and v the least and greatest, respectively, of the 
corresponding values of m(x). Here it is essential that D has only a 
finite number of (I,x) so that v is finite. Then v2u2N. As the 
integral is finitely additive (Theorem 5.1) and by (8.2), the monotonicity, 


and (8.3), 


H(E)-e<H (E) = (D) zH (1) < (0) 5H (1) < (D) 2H (1) =H (E) < H(E), 


m(x 
(8.4) H(E) - 2e < (D) fpf) (X)K(1,x) < H(E) + ©. 


This works for every such m(x) > N. We use two. First let r(x) be the 
least integer greater than N for which fay (x) > 0. If no such r(x) 
exists, f.(x) = O(j = 1,2,...),f(x) = 0, and we define r(x) to be N. 
Also s(x) 2 r(x) is the least integer, even if r = N, for which 


(8.5) f(x) 2 Fe (0) 2 f(x) = ef (yy (x) 


Let 6'(x) = Spgs 82x) = S fs (x) = min(6'(x), 8°(x)). IF 0 


is a 6-fine division of E then D is sl-fine and sĉ-fine, and by (8.4), 
the monotonicity of F500) in j, and (8.5), 


H(E)-2e < (D) = f(x)k(I,x) < (D) Ef qx) (x)K(1 x) + e(D) x fpg) COK) 


<H(E) +e+ e(H(E) +e) = (1 + e)(H(E) + €). 


As e > 0+, the first and last values tend to H(E) and so fk integrates 
over E to H(E). 

This is a rather more general weak convergence theorem than is proved 
in Lebesgue theory, where k(I,x) = k(I) is a non-negative measure. When 
k(I,x) 2 0 is of bounded variation we forget r(x), replacing 
F(x) - efx) (X) by F(x) - e. 

Next is the strong monotone convergence theorem, due to Vitali in 


Lebesgue form, 
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Theorem 8.2: In Theorem 8.1 we omtt the boundedness of (F 5(x)) in j for 
each x € ee proving tt true k-almost everywhere from the boundedness of 
(H; (E)). If X is the set of x where the limit does not exist, we put 


f(x) = Tim F CX)K(NGX), and then f(x)k(1,x) is integrable over E to H(E) 
Jeo 
els 


(8.6) l Tim £.(x)X(XX3x)dk = Tim | f.(x)dk. 
E jroo joote J 

Proof: Again we replace f} by f.-f, 2 0, and so can assume f(x) 20. 
For X the set where (5 (x)) is unbounded, and N > 0 a fixed integer, let 
t(x) be the least integer such that 


(8.7) f (x) 2N (x € X), t(x) = 1(x EXX). 


t(x 


Using Theorem 5.3 (5.6), given e > 0 let 8; be a positive function on ES 
such that ` 


(8.8) (D) B/f,(x)k(I,x) - H;(I)] < e.29 


for each 5;-fine division D of E, and put 6(x) = 54 (x) %) > 0. A 6-fine 
division D of E has only a finite number of (I,x), so the corresponding 
t(x) have a finite maximum v. By (8.7), (8.8) and the monotonicity, 


(D) EN.k(1,x)X(X;x) < (D) Efa (x)k(I,x) < (D) Heg (1) ke 


x) 
s (D) EH (I)+e= H (E)+e SH(E)+ e, V(k3E;X) s V(k;8;E;X) < (H(E)+e)/N. 


As true for all integers N > 0, V(k;E;X) = 0, and by Theorems 5.7 (5.22) 
and 8.1, 


| Fak = Í FXX; Jae > Í fdk. 
E J E J JE 


It is usual to omit y(XX;.). 
We can now obtain information about absolutely convergent infinite 
series. 
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Theorem 8.3: Let k(I,x) = 0 and let fk and | 5|k be integrable in 


EQ e sticawds ZF 


8.9 h f .(x) | dk 
8) g fe Ha 


(8.10) f(x) = z f(x) 


is absolutely convergent k-almost everywhere and is integrable over E 


with 
(8.11) l Piik F | f,(x)dk. 
E 4 Je 


Proof: First we use Theorem 8.2 and the results 


m m 
| z |f;(x)|dk = z 


f.(x)|dk < 7 f.(x)|dk, 
od Žž, J, If09ldk s z ff 


jel 
given convergent, (8.9). Hence fy (x) is convergent k-almost everywhere, 


where 


© 
fa) = Zi |F;00]. 
and fok is integrable in E, proving the theorem for the [Fyfe If the 
fj are real-valued, the theorem is true for the [f;l + fi 2 0. Sub- 
tracting the result for |f; gives the result for real-valued fji For 
complex-valued fas by Theorems 4.4, 4.5 we deal with the real and 
imaginary parts of fj separately. 
An absolute integral is now proved to be absolutely continuous. 


Theorem 8.4: If k(1,x) 2 0 in E with F(x)k(1,x), |f(x)|k(1,x) and 


k(1,x) all integrable over E, then the integral F of fk is k-absolutely 


3 mC 
continuous in E~ and E. 
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ae tse 


Proof: For N > 0 let fy = min(|f] ,N). Then fy is monotone increasing 
to |f| as N+, and F yk is integrable by Theorem 6.3 as Nk is integrable. 
Hence for N = 1,2,..., 


Lim | fydk = | |f|dk 
p N E 


N> 


by Theorem 8.1, and, given e > 0, there is an integer N such that 
| (|f|-fy)dk < de. 
E 


If X is any set with V(k;E3X) < e/(2N), Theorem 5.4 (5.9), (5.15) with 
Ifl = |f| - fy + fy give 


V(FSE3X) = V(fk;E;X) = V(|f]k;E;X) < V((]f]-fy)ksEsk) + V(fyk;E;X) 
z | (IF| - fyldk + N.V(k;E;X) < €. 
E 
Hence the result. 


Ex. 8.1. Let the graph of fa 2 0 in [-2,2] consist of the two sloping 
sides of an isosceles triangle with base sin(m) < x < sin(m) + m“? and 
height m, together with that part of the x-axis lying in [-2,2] but not 
in the base. Find 


œ 


f dx. Is £ f(x) 

i i m=} ™ 

convergent k-almost everywhere in [-2,2], when k([u,v),x) = v-u? 
(New University of Ulster, 1981, M 313) 


Ex. 8.2. For sequences that are not monotone we consider the following 
example. Let k(({u,v),x) 2 0 in Ee. and tend to 0 when x is fixed at u or 
v and v-u +0. Let g(x) be of bounded variation in E°. If in Ri, falx) 
and k(1,x) satisfy the hypotheses of Theorem 8.2, show that F gk is 
integrable and tends to the integrable fgk as j + œ, where ts > f k-almost 
everywhere. Note that g can change sign. 
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Hint: g is the difference of two non-negative monotone increasing 
functions g4 292 and so gK > 0 (g = 1,2). We are given the existence of 
the integral Kj of f;k. Prove that it is continuous so that gj and so 
gf jk are integrable. Now apply Theorem 8.2 with k replaced by gk > 0. 
Note the use of Theorem 2.3. 


Ex. 8.3. By using Theorem 8.1 prove that 


` jx-1 a . jx-1 d 
es leas se ie je DoT 


and hence show that zl log. > O0as j>. 


Ex. 8.4. If k(I,x) 2 0 is integrable over E and f(x) a real-valued point 
function in EC such that fk and |f|k are integrable over E, prove that 
g(p,q;x) = max(min(f,q),p) is integrable when multiplied by k, where 

p < q are constants, that h(p,q;x) = (g(p.q3x)-p)/(g-p) is monotone 
decreasing as p + q- with fixed q, with limit x(X3;x) the indicator of 
the set X where f(x) > q, and that x(X;x)k(1,x) is integrable over E. 


Hint: (|f| + |p| + |q|)k is integrable. If f(x) = q, h(p,q3x) = 1. If 
f(x) < p, h(p,q;x) = 0. Thus as p + q-, h(p,q;x) + x(X3x). Next, if 
p<r<qand f(x) z r then g(p.qsx) = g(r,q;x), h(r,q;x)-h(p,q;x) = 
(r-p)(g(p.q3x)-q)/{(q-r)(q-p)} < 0. If 


p < f(x) < r, g(psasx) = f(x) and g(r,q;x) = r, 
h(r,q;x) - h(p,q;x) = (p-f(x))/(q-p) < 0. 
Now use Theorem 8.1 with a suitable sequence of p. 


9 Bounded Riemann Sums And The Majorized (Dominated) Convergence Test 


When the value space K of the brick-point functions used in the theory, 
is not the space of real or complex numbers (R or C) it is usual to consider 
continuous linear functionals on K with values in R, and try to deduce 
results for K from results for R. There is then a difficulty in applying, 
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say, the monotone and majorized (dominated) convergence tests as it is 
often difficult to see what precise conditions in K correspond to the 
required conditions in R. However, it is easy to transfer theory 
involving compact sets and Riemann sums from R to a K having suitable 
algebra and topology, which is one reason why the present section appears. 
Another reason is that it is near to one set of necessary and sufficient 
conditions for the sequence of integrals to tend to the integral of the 
limit function, this last integral existing, see Section 11. 


Theorem 9.1: For E an elementary set, and f.(x) real-valued and k(I,x) 2 0, 
both on EĈ, let f jk be integrable to H; (E) over E, let B be a real number, 
and let the positive function § on E° be such that for all §-fine 

divisions D of E and all choices of functions j(x) taking positive 


integer values alone, 
(9.1) (D) EF 5 (py (xk (Lx) 2 B. 


If lim inf H.(E) 2s finite then (lim inf f.)k is integrable over E and 
J> œ j +o 


(9.2) | lim inf fdk < lim inf | f dk = lim inf H,(E). 
E 


E j+o j>% j> œ 


If (9.1) ts replaced by 
(9.3) (D) ZF sy) kx) <s B 


for the same §,D,j(x), with lim sup H, (E) finite, then (lim sup f 5)k is 
j>% j> œ 
integrable over E and 


(9.4) | lim sup f.dk = Tim sup | f jdk = lim sup H,(E). 
i i ; i 


E j+o J >+ œo J> œ 


If f} > g (finite) k-almost everywhere, as j > œ, with (9.1), (9.3) replaced 
by 


(9.5) B < (D) E fjg) Kx) came 
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for some real numbers B < C and for all choices of positive integer valued 


functions j(x), then gk is integrable over E to H(E) = lim Hj (E), which 
J>% 
limit exists and ts finite, i.e. 


(9.6) | (lim f.)dk = lim | f.dk. 
E jae 4 jiss JE 9 


Proof: From Theorem 6.3 and (9.1) or (9.3) we have the integrability of 


min(fy(x) s. ++ sf q(x) )k(15x) (Q>N20). 


By (9.1) the following exists and is finite, and by Theorem 8.1 it, times 
k, is integrable over E, namely, 


inf fe = lim min(f,,...sf,)s 
jaN | Q>% N Q 


monotone decreasing in Q. By Theorem 8.2, as 


lim inf f. = lim inf fs 
j >œ% Noo j2N 
monotone increasing in N, the lim inf, multiplied by k, is integrable 
over E provided that H = lim inf H.(E) is finite and so (H; (E)) bounded 
j>% 
below. For 


lim inf H,(E) = lim lim min | f.(x)dk 2 Vim lim | min #;(x)dk 
J ape Neo Q»œ N<j<Q JE 9 N-sco Qsoo JE Nsj<Q 


= Tim | int #,(x)dk, = l tim inf t Gidk 
E j>N JE Jag 


N> 
since the previous integral is bounded above by H. Hence (9.2). Then 
(9.4) follows from (9.3) and (H; (E)) being bounded above, by putting ef 
for f. in (9.2). For (9.6) from (9.5) we use the other two results, noting 
that from (9.5), (H; (E)) is bounded, so that 
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gdk = | (lim inf f.)dk < lim inf | fjdk s lim sup | fdk 
E j E 


E j æa j> œ J> œ 


Je 


g | (Tim sup f;)dk = | gdk, giving (9.6). 
E j JE 


J> œ 


Note that in (9.5) the set of values of sums lies ina compact set, and 
this is easily translatable to more general value spaces. 
We can now prove the majorized (dominated) convergence test. 


Theorem 9.2: Let f.k, pk, qk all be integrable in an elementary set E with 
k(I,x) > 0, and Fj 2P.q all being point functions, in E°, if ps fj for 
all j with finite 


lim inf | f.(x)dk, then (lim inf f.)k 
jae JE I cor 
ts integrable over E and (9.2) ts true. On the other hand, if t; <q 


with fintte 


lim sup | f.dk, then (lim sup f.)k 
yd j 


J > œ% J> œ 


is integrable over E and (9.4) is true. If p< fj <s q for all j and 
fj > g, finite, k-almost everywhere, as j > œ, then gk is integrable and 
(9.6) is true. Note that in Lebesgue integration theory, the first part 


ts called Fatou's lemma. 


Proof: Given ¢ > 0, there is a positive function § on E° such that for 
all §-fine divisions p of E, 


| (0) zp(x)k(1,x) a pdk] < e, 
E 
and so in the first part, 


f, pdk = e < (0) EPOOK(I 4) < (0) z fjg OKC), 


proving (9.1). Thus Theorem 9.1, first part, gives the result. The rest 
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of the proof is as for Theorem 9.1. 
An extension of Theorem 9.2 is as follows. 


Theorem 9.3: Let p;(x) > p(x) and ajx) + q(x) as j + œ, k-almost every- 


where in Ec, where k(I,x) > 0, such that Pjk» pk, qjk> qk are all 


integrable over E and as j > œ, 


l p.dk + | pdk, l q.dk + | qdk. 
EJ E EJ E 


Then we can replace p,q by P54, in Theorem 9.2. 


Proof: In the first part, Pj s fs means that frp 2 0. Proving the 
first part for fap we then have it for fy by adding Pj. since 


l lim inf (f.-p.)dk = | lim inf f.dk - | pdk, 
E j JJ E J E 


J> œ j +o 


lim inf | (f.-p,)dk = lim inf | fdk - | pdk. 
E Jd + owe JE 9 JE 


or 


The tests involved in Theorems 9.2, 9.3 basically belong to absolute 
integration, as is shown by Theorem 9.3, in that Fa Pn or faj is an 
absolute integral. There is a similar restriction of Theorem 9.1, last 


part. 


Theorem 9.4: If F(x) and k(1,x) satisfy the conditions of Theorem 9.1, 
last part, for E°, then (fF 5-f,)k ts absolutely integrable on E. 


Proof: Let D be a 6-fine division of E, where § is as for (9.5). First 
take j(x) = 1 (all x) and then take j(x) = j when F(x) -Fy Ox) > 0, say in 
X, and j(x) = 1 in XX. The modulus of the difference of the two sums 
satisfying (9.5) is at most C-B, i.e. 


A 
(op) 
1 

w 


(oz |f (x) Fy (x) [XO x) (Tx) 


Similarly 
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(D)z F (x) - f4 (x)[X(SX;x)k(.I,x) < C-B 


(D)z F(x) : F, (x) [k(T,x) á 20: = 2B. 


and (F5-f, k is of bounded variation on E. Hence by Theorem 5.4 (5.13), 
(5.14), (f.-F,)k and its modulus are integrable on E. 


Clearly we can replace fifi by fix) f or Taf stg 


Ex. 9.2 shows that sometimes the conclusion of Theorem 9.1 is satis- 
fied without (9.5) being true. Thus (9.5) is not necessary for (9.6). 


Ex. 9.1. If f5(x) = 5/4 1452(gx-1)?} prove that for all x, lim f.(x) = 0 
jee 

and that by the calculus, lim | p 

joe (0,1) 9 


If in this example [m] denotes the integer part of m, prove that for x > 0, 


tex exix™'y <1, 0 s(xtx™'y - 1)? < x2, 


Hence taking j = [x71], show that 
sup f (x) > R[x] 2 l-1) (0<x¥ 1) 
J 


and that if g(x) 2 F(x) for all j then g cannot be integrable in [0,1). 
(New University of Ulster, 1980, M 313) 


Ex. 9.2. Let the finite-valued point function g on ES be integrable when 
multiplied by k(I,x) 2 0 over E, but not absolutely integrable. Then as 
j >œ, fj = g/j tends to f = 0 at all points of ES while 

| fak = + | gdk + 0, 

EJ SE 
and yet fj-f} = g(1-j)/j is not absolutely integrable for j > 1. By 
Theorem 9.4 this example does not satisfy (9.5), so that (9.5) is not 
necessary for convergence under the integral sign. 


100 


Be. 9.3. Let n= 1, E= [0,1), fj(x) = x73/2 for (3+1)! <x < a7! and 


fi = 0 otherwise. Then f = 0 and the integral tends to 0. Ifg > fy 
for all j, g is not integrable. 


Ex. 9.4. Let g(x) be of bounded variation on EC and let fx) and k(I,x) 
on EC satisfy Theorem 9.1 (third part). Prove that f Oda (x) and k(I,x) 
satisfy that theorem. 


Hint: See Ex. 8.2. 


Ex. 9.5. For G a bounded open set on the complex plane with boundary a 
simple closed contour S traced out by a complex-valued function z(x) 
when the real variable x goes from 0 to 1, let (f ;(z)) be a sequence of 
functions of z with limit f(z) as j >œ. Then we can define the integral 
of fh round S to be 


p f (zdz = on f (z(x))az(x). 


Let fj(z(x))z' (x) and k([u,v),x) = v-u satisfy Theorem 9.1 (third part). 
Let T c G be a closed contour, so that TNS is empty, and let 


falz) 


pa j PRS f(z) 
g;(t) = Oni k ea” dz g(t) = Ont p zt dz. 


Noting that (z-t)7! is of bounded variation for the z € S when t is fixed 
on T, prove that gj and g exist and are regular (complex variable sense) 
on T and dj > g uniformly for t € T. Thus the full power of limits under 
the integral sign is only needed on the boundary S, the situation is 
simpler on T. 


Hint: Use Ex. 9.4 and Vitali's convergence theorem from complex variable 

theory, for a contour between S and T. Vitali's theorem is as follows. 
Let (f ;(z)) be a sequence of functions, each regular in a region D. 

For a fixed constant M, every integer j, and every z € D, let 

|F;(z)| <M. Further, let F(z) > f(z) as j >œ, at a set of points z 

having a limit-point inside D. Then f(z) + f(z) uniformly in any region 

bounded by a contour interior to D. 
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10 Controlled Convergence 


Another way of dealing in one dimension with limits under the integral 
sign uses extensions of definitions given in Section 5 above Theorem 5.10. 
When h depends on a parameter j, we say that h. is k-AC* in X, uniformly 
in j, if V(h; : E3Xq) + 0 uniformly in j when V(K3E3Xq) >» Oand X gX. 
Then the definition of k-ACG* uniformly in j, follows easily. When k =m 
and m([u,v);x) = v-u, we drop k- from k-AC*, etc. 


Lemma 10.1: (101) Zet (f.(x)) be a sequence of real or complex valued 
point functions defined on [a,b] that converges almost everywhere in 
[a,b] to a finite f(x) as j + œ», where each fym is integrable to F (1) 


on all intervals I c [a,b). 


(10.2) Let Fi be AC* in [a,b) uniformly in j, so that the Fi are 


absolute integrals, 


(10.3) Then V(Fj-F 5 [a,b)) + 0 as j,p > œ. 
(10.4) Further, let F} >F as j +o. 


(10.5) Then V(F - E [a,b)) > 0 as p >o. 
Proof: By Theorem 5.7 we can assume that f; > f everywhere in [a,b]. 
For if ~X is the set where f, does not tend to a finite limit, we replace 


fj by f; X(X:.). Given ¢ > 0, let Yp be the set of x where 


(10.6) fj) - FO)| <e (all j 2 p). 


Then Yp) is monotone increasing with limit [a,b], and by Theorem 5.11, 
V(m3 Cab) 3¥,) > V(m;[a,b))(p > œ); V(m;[a,b);5Y,) > 0 (p >œ) 

by the measurability results of Section 16. Using (10.2), 

(10.7) V(Fj;[a,b);>Y ) < e (P > Pole)), 


uniformly in j. This uniformity needs (10.2), though for fixed j, Theorem 
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5.11 gives the result directly. Further, 


V(Fj-F,sfa,b)) s vf (fif dm; [a,b)s¥,) + V(F,-F,3Ca,b)ss¥,) 


p 


< VOCE 5-# m; [a,b);Y_ ) + 2e < 2e(b-a+1) (p 2 Pgle)), 


p 


giving (10.3). We have used (10.6), (10.7) and Theorems 5.6, 5.4 (5.9). 
Using (10.4) let j >œ. As Fj is finitely additive and so [F -F l is 
finitely subadditive, then for each division D of [a,b), 


(D)Z|F-F | = Vim (D)z|F 5-F | Slim inf V(F;-F3Casb)) $2e(b-a+t)(papg(e)), 


je j>% 


V(F-F 5 [a,b)) < 2e(b-a+1) (p2 Pg(e)) 
and (10.5) follows. 


Lemma 10.2: If we assume (10.1) and replace (10.2), (10.4) by (10.8), 
(10.9), 


(10.8) the F; ([a,x)) converge uniformly on [a,b] to F([a,x)) (a <x <b); 


(10.9) the Fj are AC*(X) uniformly in j where X is a closed subset of 


[a,b], 


(10.10) then V(F 5-F) 3 [a,b);X) > 0, V(F-F 30a 4b) 3X) > 0, as jp ra 


Proof: Given the finitely additive function F. of partial sets of [a,b], 
we define the point function 6) on [a,b] by G(x) $ F ;(La,x)) (x € X), and 
on the intervals (u,v) of (a,b)\X with u,v € X we take 6; linear from 

G. (u) to G; (v). By (10.8), (10.9) the Gj are absolutely continuous 


J 
uniformly in j, while foru sr<s<v, 


G(s) - G(r) = | jdm, g5(x) = Fj(Cusv))/(v-u) (u <x < v), 


[r.s) 


and by (10.8) this gj tends to F([u,v))/(v-u) everywhere in (u,v). If 
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r< s, both in X, 


G(s) - G;(r) = Fj (Crys) = lesa f an, 


so that if also g. = f, in X, g, tends to a finite limit almost everywhere. 


Hence by Theorem 5.4 (5.9) and, Lemma 10.1 (10.3), 
V(F 5-F,sLa sb) sx) = V(F m-f ms [a,b)3X) = V(g 5m-g.ms [a,b) 5X) 
= V(AG;-AG, 3 [a,b);X) < V(AG;-AG, sLa,b)) > 0(3,p > @). 
Similarly the second result follows from Lemma 10.1 (10.5). 
Theorem 10.3: If (10.1), (10.8) hold, and 
(10.11) the funetions F; are ACG* uniformly in j, 
(10.12) then fm in integrable on [a,b) and 


f.dm > 


lew J ore eee 


Proof: Let (Xg) be the sequence of closed sets used in the definition 
of ACG*, let 


p 
Z = U X, W =Y AZ; 
aP p p P 


where Y_ is as in Lemma 10.1 (10.6). By Lemma 10.2 let r(q) be the 
integer such that 


(10.13) V(F,-Fs [a,b)3X,) < €.2°9 (all r 2 r(q)), 


given £ > 0, and take s = max(r(1),...,r(p)). Let a, be a positive 
function on [a,b] such that 


(10.14) V(F,-Fs8,50a4b)sX_) < WF -F3[a,b);X,) + 6.279 < 6.2179 
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and let the positive function § on [a,b] satisfy ô < 8, (q = 1,...,p) and 
be such that for all 6-fine divisions D of [a,b). 


(10.15) ()E| F(x) (v-u) - F.(Lu.v))| <e. 
From (10.6), (10.15), then (10.13), (10.14), 


(D)2| f(x) (v-u) - F(Lu,v))|x(Wo3x) < (D) 2] F(x)-f 


p (x)| (v-u)x(Y 3x) + 


S 


+ (D)E|f (x) (v-u) -F (Cu,v))| + (D)z|F_(Tu,v)) ~F(Lusv))|X(Z, 3x) 


p 
< e (b-a) +e + © V(F.-F38_3 [a,b)3X.) < e (b-a+3), 
gi smg q 


V(fm - F; [a,b); W) < e(b-a+3). 


As p > %, Wo is monotone increasing and tends to [a,b], so that by 
Theorem 5.11, 


V(fm - F; [a,b)) < e(b-a+3). 


This is true for all e > 0, so that fm-F has variation zero on [a,b). 
Now f is the limit of fj and F is the limit of Fas which are finitely 
additive. Thus F is finitely additive and so is the variational integral 
of fm, which is equivalent to the (generalized Riemann) integral by 
Theorem 5.3, giving the final result of Theorem 10.3. 


11 Necessary And Sufficient Conditions 


In this section we look for necessary and sufficient conditions for 
a sequence of integrable functions to tend to an integrable limit, and 
for the integrals of members of the sequence to tend to the integral of 
the limit function. In more detail, (11.1). Zet E be an elementary set, 
let the real-valued k(1,x) of bounded variation be defined in Ec, and let 
fj: EC > R with fik integrable in E (j = 1,2,...) such that f(x) > f(x) 
for all x€ Ec, ga J FA 


This basic condition is not too restrictive since if the convergence 
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of f. to f is only known k-almost everywhere, it is then everywhere except 
in a set X with V(k;E;X) = 0, and we can replace f} and f by F 5x and fx 
where x is the indicator of the complement of X. The replacement does 


not alter the integrability of fk, fk, nor the values of the corresponding 


integrals if they exist. 
We then look at two properties, 


(11.2) the integrability of fk over E, 


(11.3) lim | Fak = | fak = | lim fdk. 
E E 


j>% E jo 


Theorem 11.1: Given (11.1), a necessary and sufficient condition for 
(11.2) is that 


(11.4) (D) E fpf) COK) € S, 
for some compact set S of arbitrarily small diameter, some positive 
functions M, 6, on Ec, all positive integer valued functions m 2 M on E 
and all -fine divisions D of E. If the diameter condition on S is 
omitted, the weakened condition is not always sufficient. Note how 


close (11.4) is to (9.5), which has no diameter condition. 


Proof: As j > œ, fi tends to f everywhere in EF, Thus, given e > 0, 
there is a positive function M such that 


(11.5) Ifj - f(x)| <e (all J > M(x), all xe Ec). 


If fk is integrable to F on E, there is a positive function 6 on ES such 
that 


(11.6) Fre < (D) Df(x)k(1,x) <F+e 


for all -fine divisions D of E. Hence using (11.5), (11.6), 


F-e(1+V) < (D)EF a(x) K(x) <F+e(1+V) (m(x) 2M(x), V=V(k;8;E)) 
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and we have (11.4) with S of arbitrarily small diameter, 

(11.7) S = [F-e(1+V), F + e(1+V)]. 

Conversely, letting m(x) tend to infinity at each point x, (11.4) gives 
(11.8) (D) 2 F(x)k(I,x) ES 


as S is compact. If S has arbitrarily smal] diameter by choice of M, 4, 
as in (11.7), then (11.8) shows that fk is integrable in E. But if we 

do not assume the diameter condition on S, then (11.8) is not always 
sufficient, see Ex. 11.1 and Theorem 16.10. 


Theorem 11.2: Given (11.1), (11.2), a necessary and sufficient condition 
for (11.3) is that, given e > 0, there are a positive integer J anda 


positive function 5; on E° and depending on j, with 
(11.9) F-  < (0) F5(x)k(I,x) <F +e, F= [ tak, 
JE 


for all Sj-fine divisions D of E and all j 2 J. 


Proof: We are given that the integrals F; and F of f.k and fk, respect- 
ively, over E exist. It is thus necessary and sufficient that, given 

€ > 0, there are a positive integer J and a positive function F on ES 
and depending on j, with 


F- e< Fj <F +e (all j 24), 
and so (11.9), leading to a compact set [F-e, F+e] of arbitrarily small 
diameter. 
The necessary and sufficient conditions given in Theorems 11.1, 11.2 


are generalized to more general convergence in Section 13, and then 
specialized to particular cases in Sections 15, 18. 


Be. 11.1. If fy = 1 in [0,1) on the real line, f; = (-1)9q(q+t) in 
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2-1/q < x < 2-1/(q+1) (q = 1,2,...,5), and f, = 0 in 2-1/(j+1) < x < 2, 
then f, is integrable over [0,2). The limit f as j + œ satisfies f = 1 

in [0,1), f = (-1)%q(q+1) in 2-1/q < x < 2-1/(q+1) (q = 1,2,...), F(2) =0. 
In the respective intervals take a function M satisfying M> 1, M> q, 
with M(2) > 1. If mè then Tie = f. For suitable § > 0 the sums 


oscillate between 0 and 1, while 


J 
| fdx = 1+ z (-1)9 
[0,2-1/j) q= 


and does not tend to a limit as j + œ, and f is not integrable over [0,2) 
since the integral has the Cauchy limit property, see Theorem 5.14. 

Note that this example does not satisfy (9.5) in Theorem 9.1 since 
by suitable choice of j(x) the negative terms can be replaced by 0, and 
the positive terms give unbounded sums. 


12 Mean Convergence And LP Spaces 


Let p be a positive real number. Then for integration over the 
elementary set E in R’, 


(12.1) N(f.g), = | |f(x) - g(x)|P dh 
E 


is a metric when p < 1. When p > 1, N(f,g) 1/P is a metric. These 
metrics can be used for ‘distances' between the functions f and g in 
suitable spaces of functions, or function spaces, which may explain the 
usefulness of Theorem 12.1. 


Theorem 12.1: Zet h(I,x) 2 0, let (f.(x)) be a sequence of point functions, 
both on E°, and let the NCF ssf), of (12.1) exist for every pair j,q of 


posttive integers. Suppose that, given ¢ > 0, a ple) satisfies 


(12.2) N(f55fq)p < e (all jaq 2 ple)). 


Then there are a point function f and a subsequence j(r) of the positive 


integers such that 


108 


(12.3) f(x) = lim F 5(p) (x) finite, h-almost everywhere in E°, 


r>œ 


and for each positive integer j 2 ple), the integral below exists and 
A Nif ist). © Eu 
(12.4) (f; la € 


(12.5) If g satisfies the condition for f in (12.4) then f = g h-almost 
everywhere, a uniqueness result. 


The older notation is that a sequence (f) satisfying (12.2) is 
said to converge in mean with index p, and one satisfying (12.4) is said 
to converge in mean to f with index p. The more modern notation notes 
that N(f.g), or N(F,g),1/P is a metric, and calls the first sequence 
fundamental or Cauchy, and the second sequence, convergent to f, under 
the given metric. The theorem says that a fundamental sequence is 
convergent, so that the corresponding function space is complete. 


Proof of theorem: When p # 1 we cannot use Theorem 8.3. Pe we choose 
a strictly increasing sequence (j(r)) with N(F55fQ)5 22 eh? (j,q2d(r)). 
Let Xa be the set where 


-r 
(12.6) Ifj) - fir) S205 


By Theorem 5.4 (5.13), 


A 


2 YPV(HsEsX,) < W(|F5(_)(X) - F5 (p44) |Ph(1.x) 5) 


-erp “yp 
= ; 2 5 3EsX as 
NCF5 (ny oF 5(H41))p < V(h a) < 
If x ¢yY, = ùu X, then 
M jem J 
s E ps -M+1 
-f. oF mers, 
Re Ifj > fjera < may 
and there exists 
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Eu o a = Vin tg 


say. Thus f(x) exists in XX where 


X= A Ys V(HSESX) S V(HSESY%y) § o-MP 7(4-27P), V(h;E;X) = 0. 


Thus f exists h-almost everywhere and Fatou's lemma (Theorem 9.1 (9.2)) 
gives 


| If s(x) - #(x)|P dh $ lim inf i I 5(x) £5 (py (x)? dh < e(jzp(e)), 


rz oe 
and (12.3), (12.4). If also, for every e > 0, 
f. If; - g(x)|P dh < e (j 2 ale), 


then Fatou gives 


| |#(x) -g(x)|P dh < lim inf | IF 5 (x) -g(x)|? dh < 
E r>o gp d(r 


|. if(x) - g(x)[P dh = 0, 
and Theorems 5.4 (5.13) and 5.7, converse of the first part, give 
V(J#(x) - g(x)|P h(I,x)sE) = 0, F(x) = g(x) 


h-almost everywhere, and f is unique, modulo h-null functions. 


We now use Theorem 2.9 to prove some elementary inequalities involving 


the Young functions o, ¥. At this point we extend Theorem 6.5 slightly, 
assuming that f is such that max {m-1, min(f(x),m)} h is integrable for 
each integer m. We then say that f is h-measurable. The proof of the 
extended Theorem 6.5 still goes through, on modifying the sets W; slightly 
and using a sequence (Him) instead of a fixed H,. Thus if suitable 
Riemann sums are bounded we can take r(x) = KA (x), ¥(x), for Fs 2 0s ie 
obtain integrability results. From the first, as fg = xf (fag)? - (f- g) 
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to obtain the integrability of fgh when f and g are h-measurable and the 
corresponding Riemann sums are bounded. 


Theorem 12.2: Let h(I,x) 2 0, f(x) 2 0, g(x) 2 0 on E°, f and g being 
h-measurable, and let a,b be positive constants. By Theorem 2.9 (2.22), 


(12.7) 4 o(af)dh + k ¥(bg)dh = ab [. fg dh, 


assuming that the first two integrals exist, with equality tf and only 
of 


(12.8) — o(af(x)) 


HA 


bg(x) < o(af(x)+), h-almost everywhere. 


If for a = ags b = bo, the left side of (12.7) ts 1 then 


-1, -1 
(12.9) hi fgdh < ag by > 


Proof: (12.7) follows from the extended Theorem 6.5 and Theorem 2.9 


(2.22) in the form 

p(x) = &(af(x)) + ¥(bg(x)) - abf(x)g(x) 2 0 
If equality in (12.7), then for each integer m > 0, the set where p(x)21/m 
has h-variation zero, so p(x) = 0 h-almost everywhere. When p(x) = 0, 
(12.8) follows. Conversely, (12.8) gives p(x) = 0, h-almost everywhere, 


and equality in (12.7). (12.9) is obvious, with (12.8) for equality. 
Ex. 2.5 gives an important special case, with 


a(x) = xP/p + ¥(y) = y/q (p> 1, 1/p + t/q = 1), 
(aP/p)N(#,0), + (b4/q)N(g,0), 2 abN(Fg,0),. 


-1 RA 1/p b -1 
If N(F,0), > 0, N(g,0), =O ay = N(¥,0), > bo q 


inequality 
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= N(g,0) 1/4 Hölder's 


(12.10) N(fg,0), < N(#,0),'/Pn(g,0),179, | 


E Ẹ 


follows, with equality when, h-almost everywhere, 
(12.11) aPoT Pol = bg, aPfP = b4g9, fP/g9 = constant. 


The inequality is stil] true when N(f,0). = 0, as f = 0 h-almost everywhere 
and N(fg,0), = 0. Similarly when N(g.0), = 0. Replacing 1/p by t, 1/q by 
1-t(0< t< 1), f’ by f, g1 by g, 


(12.12) l ft! tdh < (| fah |? (| gah! È (f 2 0, g 2 0) 
E Ue 2 Ne” 7 

follows from (12.10), with equality when f/g is constant h-almost every- 
where, the left side existing by Theorem 6.4 when the integrals on the 
right exist. 

We denote by LP the space of all h-measurable functions f on ES for 
which N(|f|,0), exists, writing [If ll, = N(|#] 0), "P. Note that throughout 
the theory we suppose f finite everywhere. When p > 1+, then q > © and 


(12.13) (| gan) 3 > |[gll, Zess sup {lg(x)| : x € E°}, 

E 
the supremumof all constants b for which the set X, of x where Ig(x)| > b, 
has V(h;E;X,) > 0. We prove (12.13) by an easy inequality, and a second 
since the set of x with |g(x)| > |lgl|,. has h-variation 0, 


bv(hsEsx,)'/4 < (J. 8 g9 an /4 s |igll_v(nse)'/9. 


We denote by L” the space of all h-measurable g with llgll finite, 
and (12.10) is true here with p = 1, "q =". This follows since the set 
of all x with |fg| > If] llgl|,. finite, has h-variation 0. 


Theorem 12.3: (12.14) (Minkowski's inequality). For the usual h 2 0, if 


f.g are in LP, so is f +g, and 
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of! sha eet aaa 
fgdh < g. f dh) (| gdh 1/9, 


If + allp lfll, +Igll, (sp se). 


(12.15) Taking f 2 0, g 2 0, equality occurs when p = 1. When the finite 
p > 1, equality occurs tf and only if for constants a20,b20, 


af(x) = bg(x) h-almost everywhere. 


Proof: The case p = 1 is trivial. When the finite p> 1, f +g is in 


LP if f, gare. For if 


x), L#Cxd+g(x)[P < GEOD] + [glx |)? < C2] F(x) 1)? 


A 


|#(x)| 


IV 
a 
~ 


Lit 


HA 


la(x)|, |#Ox)+a(x)|P < 2P]g(x)]P, 


A 


|f(x)| 
so 


[fO)+g(x)|P < 2P([#(x)[P + [g(x)]P) (a11 x). 


Integrating, f+g is in LP’, and clearly we can replace f by |f], g by |g|. 
Next, as (p-1)q = p, |f| (|f] + Igl)P Íh is integrable over E by Theorem 
6.4 and (12.12). Similarly |g|(|f| + lgl)P Ih is integrable over E, and 
by Hölder's inequality, twice, 


(12.16) | Cll + Lgl Pah = | IFEI DP an + | lalate 


jalPan A (f eslo Pan)" 


s 1 |f Pan) "P (|. (Ifl+igl) )Pan 1/94 1i 


(12.17) If L (lf] + |g|)Pdh = 0, 


then, h-almost everywhere, |f| + |g] = 0, f = 0 =g, and (12.14) is true. 
Otherwise, dividing by the qth root of the positive integral in (12.17), 
(12.16) gives (12.14). With (12.17) false and the finite p > 1, equality 
occurs in (12.14) if and only if both Hilder inequalities are equalities 


and for some constants a,b, 
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[e[P = aX |el+lgl)?, |g]? = b%(|#|+/9|)? h-almost everywhere, and 


a'|#(x)| = b' | g(x) | 
for constants a' 2 0, b' 2 0. , 


The case "p = œ" has to be considered separately. Here, |f] < hal 
gll except in a set Y of 


except in a set X of h-variation 0, and lg| < ; 
h-variation 0, and X u Y has h-variation 0. Hence |f+g| < |lf{|,,+ lallo 
h-almost everywhere, and (12.14) follows in this case. Equality occurs 
if and only if, for each e > 0, the sets X- and Y, where lf] > i 
Ig] > Ilgllo- £» respectively, have V(hsE3X_ N te > Os l l 
Minkowski's inequality shows that the triangle inequality holds in 
P (1 < p <œ) if the "distance" from f to g is given by IIg-Fll,, » and this 
is a norm if f and g are identified when f = g h-almost everywhere. 
Having proved (12.10) when 1 < p < ~, we now show that the upper 


bound is attained. 


3 <2 AIG! a3 
Theorem 12.4: For the usual h 2 0, fixed f in LP, and all g in L” with 
Ill 


sup f fgdh = sup i |fg|dh = fll, lisll (sp se). 


1, we multiply g by sgn”! (fg) where 
1, so that in the integrals, fg can be 


" 


m Without altering IIgIl g 

sgn(x) = x/|x| (x # 0), sgn(0) 

ania? by |fg|, and we can assume f 20, g 20. When p = 1 we take 

g(x) = 1 (all x), so that ||g||,, = 1 and the teaeara of |fg| becomes 
4 p- p 

I|f||,- When 1 < p <œ we take g(x) = fœ) 7 fll, so that 


-1 
lg09]9 = [f091P/ Fil)?» Ig, = 1 IfO9909] = f(x) |P/ fll,” . 


with integral fll, 

To prove this when "p=", q = 1, 
(X.) of h-measurable sets i.e. such that X(X : 
As f is h-measurable there are sets Yi with union E~ such that 
m-1 < f(x) < m for x € Yp» and such that x(Y,, ;.)h and FX(Y,, 3.)h are 


we assume h VBG* using a sequence 
3.)h is integrable over E. 
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integrable over E for each m. Given e > 0, the exact set X where 

[f(x)| > |[fll,- £, has 0 < V(h3E3X) < © with X(X;.) h-measurable. Using 
the integrability of fgh when f,g are h-measurable and suitable Riemann 
sums are bounded, we see that for Zin = X; A YaN X(Zjm:*)h and 
[FIX 3+)h are Saco As se union of he Zin is EC n X we choose 
j.m so “thet Z = Z.p has 0 < V(h3E;Z) < =, and put 


g(x) = nae XZ) 5 Illy = 15 f [falah = 


IV 


I, If] X(Zs.)dh/V(h3E3Z) 2 |IFII_- 


This proves Theorem 12.4 when "p = œ, q = 1, 

Returning to the general Young functions, we cannot always prove 
Theorem 12.2 (12.9) for constants agbo: For example, this cannot be 
done explicitly with the functions of Ex. 2.6. tiowever: using Theorem 
12.4's result as a definition we can proceed. Let Ls be the set of all 
real- “eats h-measurable point functions f with 6( fh) integrable over E. 
Then l is not always linear. For in one dimension let E = [0,1) and 
h=m -A m([u,v),x) = v-u. If ¥(y) = e71 (eY-1) (Ex. 2.6) and 
f(x) = log (x74) then ¥(|f(x)|) = ea! (x73 - 1), integrable with respect 
tom. But 2.1og (x74) does not lie in the set as ¥(|2f(x)|) = el (xt), 
not integrable over [0,1) with respect to m. aes 

To obtain linear sets of functions we define Lo to be the set of 
h-measurable point functions f for which lf ll, is BONS where for all 
h-measurable g with 


| vlg)dh s 1, Iflg = sup | | fg|dk, 
E E 


the definition following the result of Theorem 12.4. 


Theorem 12.5 (12.19: For the usual k 2 0, the space Lọ ts linear, and if 


f,.f, are in Las 


It, + fallas If lla + ifla 
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* . * = 
(12.20) FFE L then lf ll, g i; ol |f|)dh + 1 and La contains L . v(p.) < V(h;E;x,) 1 g(x) = p5X(K55X) | ¥( Jg; |)dh 
E 


(12.21) In the definiti f | fadh o tase | fg|dh. 
n the definition of || Ile p gdh can replace z [fg] Wp; )x(X53x)dh = ¥(p5)V(hsEsX 5) s 1, fll, 


LL 
oO 
v 


(12.22) Let h 2 0 with variation not identically 0, be VBG* using a 


i 
oO 


sequence (x) of h-measurable sets, and let » be not identically a f. |fg;|dh i i [FIP XQ ssx)dh, If X(X43.) i 


0. Then IF ll g= 0 if and only if f = 0 h-almost everywhere. h-almost everywhere. Hence f = 0 h-almost everywhere in X} CF a EA 


and so h-almost everywhere in pË, the union of the Xj. The converse is 


Proof: The linearity is clear, while if Fy sfo are in Lo and easy. 


l KE adjam ty Sian | (Fy s#p)aleh 3 J lFyalan $ f ifzalan < lifi Ilg Theorem 12.6: (12.23) If h20, f€ Lg, but ||fllaž 0, then f/ ||fllẹ ties 
in L? 

a3 
iá folly - 


Hence (12.19). As f.g are h-measurable, so are o(|#|), Wal), and we (12.24) If also there is a constant H > 0 such that (2x) < Mo(x) 


can consider the integrals below. Using Young's inequality, Theorem 
2.9 (222); 


(x 2 Xg 2 0), then La = h : 


(12.25) (H8lder's inequality for Orlicz spaces). If f € Los gE Ly» 


i |fgļdh s i a(|f|)dh + I. ¥(|g| )dh. and f,g are h-measurable, then fgh and |fg|h are integrable and 
In the definition of IIfll, 
that if the second integral is finite, f € , then f € Ls with (12.20). 
In the first integral of (12.21), replacing g by g.sgn”'(g) we have the 
second integral. When ||f||,= 0 with ọ and the variation of h not 
identically 0, and h VBG* using h-measurable sets (X5)5 we amalgamate 
every Xj of h-variation 0 with the first X. of the sequence with positive 
h-variation, and so assume that 0 < V(h3E3X.) <œ (j = 1,2,...). As o is 
not identically 0, there are points r > 0, s > 0 with 


the last integral is not greater than 1, so | fgdh| < | lfgidh < |lfll, [Igll 
s í ò y: 
E E 


Proof: First we prove Ex. 2.4, to have 


IV 
=< 
v 
< 
IV 
Oo 
~ 


(12.26)  ¥(qv) 2 q¥(v) (q 
Being obvious for q = 1 or v = 0, we assume that q > 1, v > 0 


v(xddm 2 (q-1)v(v) = (q-1) | v(v)dm z 


¥(qv) - ¥(v) = 
[0,v) 


s = o(r), W(s) < r, (v) <r(v < s), ¥(v) < rv (vss), lim ¥(v) = 0. Teed 


v>0+ 
Of course the last result follows from (5.44) as soon as w is finite, but 
we need to prove y finite first, and the continuity is then a short step. 
Thus, for each positive integer j there are a number Pj > 0 and a function 


(q-1) | w(x)dm = (q-1)¥(v). 
[0,v) 


f 
If £ € Lgs plg) = f YClgl)dh s 1, then | |faldh < |lFll, » 
gj on E~ such that E E 
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from the definition of Ifill: If 1 < plg) < œ, (12.26) gives o(Up Iifll, ) and is h-measurable, and so is integrable with respect to h. 
The integrability of o(|f,|)h is proved as for ug = 0, and o(|f|) = 


¥(|g|) < p(g)¥(|g|/olg)), olf l) +l f|), giving (12.24), 
For (12.25), as f,g are h-measurable, fgh and |fg|h are integrable. 
| ¥(|g|/p(g))dh < | ¥(|g|)dh/o(g) = 1, For if llalliy 0, g = 0 h-almost everywhere and the result is trivial. 
È E Otherwise, replacing f, è by g, ¥, in the proof of (12.23), 


p(g/ |Ig||, ) $ 1, so that by definition 
f; Ifo/ololdh s Iflg» |, Ifgldh < Iflg o0), ! 


falan = f [#(9/ lglly) dh. lolly = itlly lolly- 
E EE fo Iolar = f lly y < ifile lially 


In the statement (12.24) we can replace 2 by any number N > 1. Even 
I, |fg|dh s IIfllg o*(g), p*(g) = max (o(g),1). so, the condition on © is very restrictive, and is not satisfied by the 
Y of Ex. 2.6. 
At the beginning of the section, Theorem 12.1 proves the completeness 


of LP spaces for al] finite p > 0. We can now extend this to Lo spaces. 


As f is h-measurable, so are |f|/ fll, and o(|f|/ fll). Hence the 
integral of the latter exists if and only if suitable Riemann sums are 
bounded. If g = o(|f|/ fll.) then g is h-measurable and we have 
equality in Young's inequality (Theorem 2.9 (2.22), (2.23)) at each 


Theorem 12.7: Let h 2 0 be VBG* using a sequence (X.) of mutually 
point. As all the Riemann sums involved have non-negative terms, the J 


disjoint h-measurable sets. Then Lo ts complete, t.e. tf fa € Lo 


following integrals exist and we have (j 2 1 sty0c0) and Tim If, - fll 0, there is an f € Lẹ» uniquely 


r,s > @ 
o*(g) 2 i | (/ If ll, )gldh a I. a(|F]/ II ||, dh + olg). determined h-almost everywhere, such that me Ifa - fll, = 0h 
If polg) = p*(g) then | a(|F|/ IIf ll, )dh 0. nae ni € > 0, there is an N = N(e) such that for all g with 
E 0< plg) S15 


If plg) < (g) then p*(g) = 1, | oC |FI/ [lfll Jan < 1. 
E (12.27) | If, - f l-lgldh < e (r,s 2 N(e)). 
E . 
Thus in either case (12.23) is true. 
For (12.24) we have L> c Lẹ. If f € Ly and ||f|] #0 then 
f/ IfI E Lae For some p, || fll, <2P. Hence f€ LŽ when ug = 0 since 


If » = 0 everywhere, so is 4, Lo contains all h-measurable functions, 
and we have only to prove that the limit function of a sequence of 
h-measurable functions is h-measurable. See Section 16. So we proceed 
a(|f]) < MPo(|f|/2P) < MP a(|F|/ lIfll,)- | a(|f|)dh < as in the proof of (12.22), obtaining ¥(x) + 0 (x + 0+) and assuming 

that 0 < V(h;E;X;) <œ (j= 1,2,...). Thus there is a constant P} > 0 
MP. such that 


IA 


MP f. o(|F|/ IFI] Jah 


Ex T1 s 3 x ete 
If ug > 0 we write f = f} + f, where f, = f on the set in E? where ¥(p5) sV(hsEsX5) . If g; = p5x(X53.), k Y(]g;])dh = ¥(p5 )V(hsEsX5)s1. 


lf / |Ifll, < Ug» and f} = 0 elsewhere. Then 4(|f,|) is bounded by 
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Using Theorem 12.1 on (12.27) with g = gj» there are a point function f 
depending on j and a subsequence (rs(s)) of the positive integers such 
that 


(12.28) ou cr = f(x) 

h-almost everywhere in Kj (g; being zero elsewhere). By the proof of 
Theorem 12.1 we can arrange that for j = 2,3,..., (r,(s)) is a sub- 
sequence of (r5_4(s)), so that for r) = rj (3), rọ is in (r5(s)) for 
each k > j, and so, h-almost everywhere in EE, 


lim fp (x) = f(x), 

dem 9 
where by the usual mathematical economy of symbols, f(x) here is the 
f(x) in (12.28) when x € Xj. Applying Fatou's lemma to (12.27) with 
S = rj» uniformly in g with p(g) < 1, 


|, lf, - f|, |gldh < e (r 2 N(e)), Ilf, - Ellar 0Oasr>+o, 


Again by Fatou's lemma, we have uniqueness as usual, and hence the 
theorem. Ex. 12.5 shows the need in the next theorem to have f) € Loo 


g; € L,, with bounded sequences of norms. 


y? 


Theorem 12.8: Let h 2 0 on E° with (Fj) c Ly» (9j) E Lys and let f, g 


J 
be the respective limit of (f.), (g.) using ġọ, Y, respectively (i.e. 
lf} = flle > 0, |lg - I5lly +0). Also, for some constant M > 0 let 
lf; lls <M, IIgs lly SM (j = lysis) Mhen f E€ Lo? ge Lys and fagh 


and fgh are integrable and 


f.g.dh > | fgdh. 
f Jag E 

Proof: As in the proof of Theorem 12.1, a subsequence (j(r)) of the 
positive integers satisfies fi > f(x) h-almost everywhere. Then 
(gjg) is convergent using ¥, and so for a subsequence (k(r)) of 
(j(r)), Ok (r) (x) > g(x) h-almost everywhere. Thus also fetr) > f(x) 


120 


h-almost everywhere. Thus if g is an h-measurable function with o(g) < 1, 
and using Fatou's lemma, 


"AE She Sa f z 
le fgdh = | lim inf Fecpygah < lim inf fe Fropygeh = 


E r> Y > œ 


Lim inf |lFycry lly $" 


Y +0 
Hence f € L.. Similarly g € Ly on interchanging the roles of f, f} and 
I» gje Hence p(g/ Sly? < 1 and so f(g/ IS lly 2h is integrable. 
Similarly F 59h, fjgh, Fgh are all integrable for each j. Finally, 
using Theorem 12.6 (12.25), Hölder's inequality here, 


iF f gjah - f fgdh| = f ((f;-Flg + PCgj-9) + 


(f5-F)(g5-9)} dh] < [Fj-FIl, [Sly 
+ [FIL gl9s-Slly* Nafl gll9s-Slly 78 28 5 > = 


Note that if f € La then by Theorem 12.5 (12.19), 


Ifl $ ly fly Aii 


and the sequence ( Ifill? is bounded, and similarly for gj. Thus the 
extra condition is necessary as well as being sufficient. 


Ex. 12.1. A sequence (f,(x)) can converge in mean without being point- 
wise convergent anywhere in E°. For in one dimension put f.(x) = 0 for 
0 <x <1, except for two sloping sides of a triangle, the third side 
being on the x-axis. As j + œ let the area of the triangle tend to 0 
while the triangle moves repeatedly from 0 to 1 with height tending to 
infinity. 


Ex. 12.2. If a sequence (#5(x)) is pointwise convergent and convergent 
in mean, then the limits are the same h-almost everywhere. (Use Fatou's 


lemma. ) 
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Ex. 12.3. For one dimension with h(I,x) the length m of I, let a; be 


caus ee take J 
real with Z a, convergent. Then f.(x) = z a_ sin sx is convergent 
jet J J s=1 5 
in mean with index p = 2, to a function f(x) over ES = [-n.0]. For if 
E = [-1,7), 


, , 0 (s # t) 2 
| sin sx sin tx dm = { and | (f,-f ) dm = 
E m (s = t) E š 
eo o 
T 2 adi sce Y) 
j=s+1 J 
Ex. 12.4. Show that the sequences (s(x), (t5(x)) converge in mean 
. . J = 
with index 2 over E = [-,7) where s(x) = E u 1 (log u) sin ux, 
i 22 
j È u 
t(x) = 2 (ulog u) Y in ux. If f,g are the respective functions to 
u= 


which Sj» t are convergent in mean, find 
| f(x)g(x)dm 
E 
(New University of Ulster, 1977, M 313) 
Ex. 12,5. Let (fF) be convergent in mean to f with index p and let g 


be any h-measurable function. Then (F) + g) is convergent in mean to 
f + g with index p. 
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CHAPTER 4 


LIMIT THEOREMS FOR MORE GENERAL CONVERGENCE, WITH CONTINUITY 


13 Basic Theorems 


We can replace the integer variable j of Chapter 3 by a variable y 
that takes more than a sequence of values, and lies in a directed set Y, 
and then we often run into difficulties. Some of the difficulties can 
be seen when y is a real variable in [0,%) and we take the limit as 
y >œ. Putting y = (wea)! and aw, we can deal with limits w > a+ 
and w > a-, respectively, and so can consider the continuity of an 
integral with respect to a parameter w at the point w =a. At this 
stage it is thus enough to give the tests of Sections 8, 9, 10, replacing 
j +o by y >œ, and a general Y replacing j > œ in Section 11. 


Theorem 13.1: For E an elementary set, k(1,x) > 0 in EC, and f(x,y) 
real-valued and monotone increasing in y for y in [0,0), let 

f(x,y)k(I,x) be integrable over E to H(E,y). If H(E,y) is bounded above 
for y 2 0, with supremum H(E), then f(x,y) tends to a limit f(x) as y > œ, 
except in a set X of x with V(k3E3X) = 0 (i.e. the limit exists k-almost 
everywhere) and f(x)X(X3x)k(1,x) is integrable over E to H(E). 


Proof: Restricting y to the integers j, by Theorems 8.1, 8.2, f(x,j) 
tends to a limit f(x) as j + ©, except in such a set X, and 
f(x)x(X3x)k(1,x) is integrable over E, to the supremum of H(E,j) in j, 
which is H(E) by monotonicity in y. Similarly f(x,y) + f(x) as y + œ, 
except in X, and lim H(E,y) exists equal to H(E). 

y>% 
Theorem 13.2: For E an elementary set, k(1,x) > 0 on BA and f(x,y) 
real-valued for x € E°, let f(x,y)k(1,x) be integrable to H(E,y) over E 
for each y > 0. If there are a real number B and a positive function § 
on E? such that for all §-fine divisions D of E and all choices of 


functions y(x), 


(13.1) (D) = F(xsy(x))k(1,x) 2 B; 
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tf lim f(x,y) exists equal to f(x), finite, k-almost everywhere, and if 
->00 


lim inf H(E,y) is finite then fk is integrable over E and 


y >œ 


(13.2) | fdk < lim inf | f(x,y)dk = lim inf H(E,y). 
E E 


y>% y >œ 


If (13.1) is replaced by 
(13.3) (D) zf(x,y(x))k(I,x) < B 


for the same 6, D, y(x), with f(x,y) tending to a finite limit f(x) k- 
almost everywhere and 1im sup H(E,y) finite, then fk is integrable over 


y >œ 


E and 


(13.4) l fdk > lim sup | f(x,y)dk = lim sup H(E,y). 
E E 


y >œ y>% 


If (13.1), (13.3) are replaced by 
(13.5) B < (D) x f(x,y(x))k(I,x) <€ 


for some real numbers B < C and for all choices of functions y(x), and 
if the other conditions hold, then fk is integrable over E to H(E) = 
lim H(E,y), a finite limit, i.e. 


Yo 


(13.6) | lim f(x,y)dk = Tim | f(x.y)dk. 


Proof: There is a sequence (y,) of y tending to + œ, such that 


lim | f(x,y.)dk = lim inf | f(x,y)dk. 
joo JE J y > œ E 

Applying Theorem 9.1 (9.2) to f(x) = F(xsy5)s we have (13.2) since, 
k-almost everywhere, 


lim inf F(x) = lim f(x,y). 


j> œ Yoo 
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Similarly for (13.4), and then (13.6) follows. 
If in the proof of (13.2) we drop the condition that lim f(x,y) 


y > œ 
exists (finite) k-almost everywhere, then Theorem 9.1 (9.2) gives the 


existence of the integral of lim inf f(x,y 5) which can be put in place 
j >o 


of the first integral in (13.2). As this lower limit is not less than 


lim inf f(x,y), we can multiply by k and integrate it over E, to 
y >œ 


replace the first integral in (13.2), provided that the integral exists. 
But Ex. 13.1 shows that we cannot take the integral's existence for 


granted. By Theorem 13.1 the integral does exist if inf f(x,y).k is 
Osysm 


integrable over E for each m > 0. A similar discussion can be made of 
(13.4), replacing every inf by sup, while (13.6) need not be altered. 

Similar changes can be made of the majorized (dominated) convergence 
test, 


Theorem 13.3: Let y be a real-valued parameter, let E be an elementary 


set, Let k(I,x) 2 0, and let f(x,y), p(x), q(x) be point functions, all 
defined on EC, Let f(.sy)ks pk, qk all be integrable in E. 
If ps f(.sy) for all y, with finite 


(13.7) lim inf l f(.,y)dk, and inf f(x,y).k (alla <m) 
y >œ E asysm 


integrable, then lim inf f(.,y)k is integrable over E and 


yro 


A 


(13.8) f lim inf f(.,y)dk < lim inf l f(..y)dk. 
E yso y> œ E 


On the other hand, if f(.,y) < q for all y, with finite 


(13.9) lim sup | f(.,y)dk, and sup f(x,y).k (all a < m) 
y>% E asysm 


integrable, then lim sup f(.,y)k is integrable over E and 


y> œ 


125 


(13.10) Tim sup | f(.,y)dk < | lim sup f(.,y)dk. | giving (13.14), (13.15). 
y> E E y >œ 
Lemma 13.5: Assuming (13.12) with 
(13.11) If f(x) = lim f(x,y) exists (finite) k-almost everywhere, and if 
y> œ 
13.18) the Fly;[a, 3 L L t sb] t 
p< f(..y) <q (all y), then fk is integrable and (13.6) holds. ( a PIDA concealing sae panning oak Tap a 


Here, (13.7), (13.9) are not needed. F(La.x))(a < x < b), 


: ý i ; z 
roota. Wee Trent ee. (13.19) the Fly3.) are AC*(X) uniformly in y where X is a closed subset 
, A : of [a,b], 
We now extend the theorems on controlled convergence in one dimension. 
«2 hen V(F(y;.) - Flus.)3 [a,b);X 0, V(F-F(u;.)3La,b)3X) + 0 
Lemma 13.4: (13.12) For each y > 0 let f(.,y) be a real or complex (13.20) than W(Flys.) (u;.); [a,b);X) + (F-F(u3.)3Lasb)3X) 
valued point function on [a,b] that converges almost everywhere in (y < Us y > œ). 
[a,b] to a finite f as y + œ, f(.sy)m being integrable to F(y;I) on all 
intervals I c [a,b). Theorem 13.6: If (13.12) and (13.18) hold and 
| 


(13.13) Let Flys.) be AC* in [a,b) uniformly in y, so that the F(y;.) i (13.21) the functions F(y;.) are ACG* uniformly in y, 


are absolute integrals. 


(13.22) then fm is integrable on [a,b) and 


(13.14) Then \(F(y;.) - F(u;.); [a,b)) > 0 as y,u > œ. 
| #(.sy)an > | fdm (y > œ). 
[a,b) [a,b) 
The proofs of Lemma 13.5 and Theorem 13.6 parallel the proofs of Lemma 
10.2 and Theorem 10.3. 
Next we examine the necessary and sufficient conditions, using a 
directed set Y of parameters y, and the limit process ni of Y. 


(13.15) Further, if Fly;.) > F as y > œ then \(F-F(u;.); [a,b)) > 0 


as U > œ. 


Proof: As in the proof of Lemma 10.1 we can assume that f(.,y) con- 
verges everywhere. For integers p and given e > 0, let Yo be the set 


of x where 
Theorem 13.7: (13.23) Let E be an elementary set, let the real-valued 


function k(1,x) of bounded variation be defined in Ee, and for each y E Y 


let f(.,y) :E° > R with f(.,y)k integrable in E, such that lim f(x,y) = 
Y 


Then we again prove that, uniformly in y, f(x) for each fixed x E Ec. A necessary and sufficient condition for 


the integrability of fk over E, ts that 


(13.16) | #(xsy) - #(x)| < e (all y 2 p). 


(13.17) V(F(ys.)sfasb) ss¥,) < elp 2Pple))s V(F(y;.)-F(u;.);[a,b)) <2e(b-a+1) 


(y MER k (13.24) (D) 2; f(x,y(x))k(1,x) € S 
= 0 > 
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for some compact set S of arbitrarily small diameter, a positive funettion 
6 on ES, a funetion MiE° > Y, all functions y(x) sE° + Y satisfying 
y(x) 2 M(x) everywhere on E? in the direction of Y, and all §-fine 


divistons D of E. 


Theorem 13.8: Given (13.23) and the integrability of fk over E, a 


necessary and suffictent condition for 


Vin | f(.,y)dk = l lim f(.,y)dk 
Y E E Y 


ts that for some compact set T of arbitrarily small diameter with S N T 
not empty, and a J E€ Y, a positive funetion §(.,y) on ES for each ye Y 
with y 2 J, and all §(.,y)-fine divisions D of E, 


(13.25) (D0) x f(x.y)k(I,x) €T (y 2 J) 


See the proofs of Theorems 11.1, 11.2. 
These results are specialized to differentiation in Section 15, and 
to integration in Section 18. 


Ex. 13.1. In one dimension let X be a set in [0,1) for which its 
Characteristic function or indicator x(X;x) is not integrable over [0,1) 
with respect to m(I,x), defined as the length of I. ForO<x< 1, 
gye 1, and q = 1,2,..., let 


0 (x=yeX) 
f(x,y) = { f(x,y + q) = f(x,y). 
1 (otherwise) 


Then f is bounded in [0,1] x [0,+) and is integrable with respect to 
m in (0,1) for each fixed y, while 


lim inf f(x,y) = inf f(x,y) = 1 - x(X3x) 
y>% Osy<1 


is not integrable. 
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A construction of such an X, using the multiplicative axiom, or 
Zorn's lemma, of set theory, is as follows. Let Z(x) be the set of all 
z in 0 < z < 1 with z-x rational. Let X be a set in [0,1) such that 
for each x in [0,1), X n Z(x) contains exactly one point. It is the 
existence of X that needs the multiplicative axiom. If (r) is the 
sequence of mutually distinct rationals in [0,1) let X. be the set of 
all x + rie xX + rj - 1 that lie in [0,1), with x € X. Then Xj splits 
up into two parts, each congruent to a part in a corresponding split 
of X. Hence if the second integral exists, 


| xX(x.3x)dm = | x(X3x)dm, k | x(X3x)dm = 
(0,1) 4 [0,1) me: 

T ) (X3x) 

u X.;x)d li X;x)dm = 0, 
E S ETR lnt ney 
| x( U X.3x)dm = 0 
[ts got A 


e.g. use the monotone convergence theorem. But U k = [0,1), giving 
j=1 
a contradiction. Hence x(X;x)m cannot be integrated over [0,1). 


14 Fatou's Lemma And The Avoidance Of Nonmeasurable Functions 


This section deals with the problem of obtaining Fatou-type results 
when sequences of functions depending on an integer parameter j, are 
replaced by functions depending on a parameter y lying in a uncountable 


directed set Y having a limit process lim. We have to avoid non- 
yey 


measurable functions such as turn up in Ex. 13.1. 
Theorem 14.1: (14.1) Let E be an elementary set, let h(I,x) > 0 in ES, 
and for each y € Y let f(.sy) : E° > R be such that f(..y)Jh is integrable 


over E. 


(14.2) If for some u E Y there is a compact set S depending on u, such 
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that for each sequence (y,) c Y with every Yj 2 Us some positive function 
§ on E° and depending on (y.) and u, each §-finé division D of E, and 


each integer-valued function j(x) on Ee. 


(D) E f(x. 5(y))A(I5x) E€ S, then g(x, (y;)) = xy F(xsy5) 
exists (finite) h-almost everywhere and ts integrable over E, say to 
G((y;)). As (y5) varies in Y with every Yj > u, then G((y,)) has an 
infimum, say J(u), there is a (z,) c Y with every z, 2 u and G((z5)) = 
J(u), and there is a function k(u;.) : E° + R, independent of (y;) and 
integrable over E to J(u), such that g(x,(y.)) 2 k(u,x) h-almost every- 
where, with g(x, (y;)) = k(u;x) h-almost everywhere when G((y,)) = J(u). 
Conversely, tf the G((y,)) extst and have an infimum, then (14.2) holds. 
Further, 


(14.3) J(u) < inf | f(x,y)dh. 
y2u ʻE 


We write k(u3x) as inf* f(x,y), h-measurable. By the construction, 
y2u 
h-almost everywhere, 


k(u3x) 2 inf f(x,y), 
y2u 


but the last lower limit need not be h-measurable, see Ex. 13.1. 
Proof: From (14.2) for each positive integer m, 


(14.4) (D) 2 min f(x,y.)h(I,x) € S, and min f(x,y.)h(I,x) 

Isjsm 3 1<jsm J 
is integrable by Theorem 6.3. Theorem 8.2 (strong monotone convergence) 
with (14.4) gives the integrability of g(x, (y;))h to a value in S. 
Thus elly;)) has an infimum J(u) in S, and so finite. Hence for each 
q = 1,2,... there is a sequence lyg? with G((y,5)) < J(u) + 1/q. For 
(z,) the sequence Y442¥ 409% 242¥ 139° 099V 34 oY yqorees each (y ;) isa 


subsequence, G((z;)) < G((y,5)) du) + 1/q@ (q = Is2ss00ds G((z5)) =J(u). 
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Writing k(usx) = g(x;,(z:)), we show that it is unique modulo values on 
sets of h-variation 0. “For if (w;) satisfies G((w.)) = J(u) let (t) 
be the sequence Z42Wy2ZosWosZgoeee - Then 


G((t;)) s G((z,)) = J(u), G((t,)) = J(u), g(x, (t,)) < g(x (z;)), 


I, (g(x, (25)) - g(x.(t,))dh 


h-almost everywhere; so, similarly, g(x, (t,)) = g(x, (w;)) h-almost 
everywhere, and g(x, (w;)) = g(x,(z.)) = k(u;x) h-almost everywhere. 
Conversely, if J(u) is finite, (14.4) follows from 


(14.5) Itu) s Glyj)) s | Flys)dh ( = 1.25000) 


for some positive function § on E° and depending on (y,) on u, and then 
(14.2). Further, taking any particular y € Y in y 2 u as a member of 
(yj); (14.5) gives (14.3). 

For f(.,y) monotone increasing in y, k(u;x) = f(x,u) will suffice. 
But if f(.,y) is monotone decreasing in y we need the full construction 
except when Y has a linear order so that in each finite subset of Y 
there is a greatest y. This has relevance in the proof of the first 
part of the next theorem, as k(u;x) is monotone increasing in each 
sequence of u, h-almost everywhere. We need the limit as u rises in Y, 


obtained using sup*. 
uEY 


Theorem 14.2: (14.6) Let the conditions of Theorem 14.1 hold for all 
u € Y. Writing sup* k(u;x) as lim inf* f(x,y), we have a Fatou-type 


ucY yey 
result, 


(14.7) | lim inf* €(x,y)dh < Tim inf | #(x,y)dh 
E yey u€Y y>u JE 


in the sense that if the right side is finite then the left stde exists 
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with the inequality. Similarly, 


(14.8) | lim sup* f(x,y)dh 2 lim sup | f(x,y)dh, 
E yey ueY y2u / E 


(14.9) and if T inf* f(x,y) = lim sup* f(x,y) h-almost everywhere 
y yY ? 


and written as lim* f(x,y), 
yey 


(14.10) | lim* #(x,y)dh = lim | f(x,y)dh. 
E yey yeY JE 


Naturally, sup* r(x,u) = -inf* (-r(x,u)), lim sup r(x,y) = 
uEY uEY yey 


-lim inf* (-r(x,y)). 
yey 


The proof of this theorem is now straightforward. 
Ex. 14.1. Let Y be the collection of all finite sets y of real numbers. 


For y,u € Y let y < u have the meaning y cu, and let 


0 
f(x,y) = sup g(x,z), g(x,z) = { ae 


Y 
= 1 (x = 2) 


Then f = 0 = g except at a finite number of points, and so for 
m(Cu,v),x) = v-u, 


lim* f(x,y) = 0 
y Y 
almost everywhere. But 


lim f(x,y) = sup f(x,y) = 1 
yey yey 


everywhere. 
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CHAPTER 5 
DIFFERENTIATION, MEASURABILITY AND INNER VARIATION 


15. Differentiation Of Integrals 


First we consider the differentiation of the integral with respect to 


a parameter, 


- | lim Fayth) = fixy) ak = atx) ak, 
E hoo p % 


The case in one dimension of k = m where m([u,v),x) = v-u, aroused great 
interest early in the history of the calculus as it was realised that if 
the process could be legitimised, many difficult integrals could be 
computed. As a result, C. Jordan in 1891 or earlier, requested the 
following, quoted by C. de la Vallée Poussin (1892). 
'Give a rigorous theory of differentiation under the integral sign 
of definite integrals, with precise conditions which limit Leibnitz's 
rule, principally for unbounded regions of integration or unbounded 
functions, and particularly many celebrated definite integrals.’ 


After a century, we can give necessary and sufficient conditions for 
which (15.1) holds. Here, h is real, so that Section 13 can be used and 
we can apply various tests in particular cases, even when the elementary 
set E is changed to intervals [a,to), (-«,b), (-«,+0), or infinite volumed 
bricks in higher dimensions, provided that we use the integral of Section 
5 after Theorem 5.14, or the analogue in higher dimensions. The tests 
assume h > 0. Similar tests hold when h < 0. For notation we use 


R(#3x,ysh) = {f(x,y+h)-f(x,y)}/h, R(H;E,y,h) = {H(E,y+h)-H(E,y) }/h. 


Theorem 15.1: (15.2) Let E be an elementary set, k(I,x) 2 0 in EC; and 
f(x,z) real-valued for x € EE, with f(-z)k integrable over E to H({E,z), 
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for each z in a neighbourhood of y, so that R(f3x,y,h)k is integrable over If (15.4), (15.8) hold for some real numbers A < B and all choices of 


E to R(H;E,y,h) (h > 0). positive functions h, with (15.5), then (15.6), (15.9) hold with (15.1) 
for right-hand derivatives. 
(15.3) If R(f;x,y,h) čs monotone in h > 0 and the monotone R(H;E,y,h) is Next, Theorem 13.3 gives a majorized (dominated) convergence test. 


bounded as h + 0+, then R(f3x,y,h) tends to a finite Limit k-almost 
Theorem 15.3: Assuming (15.2), let p,q be real-valued funetions with 


p<qon ES and pk, qk integrable on E. Then (15.7) follows from (15.5), 
ts true on the right at y. (15.6), and 


everywhere as h + 0+, that we can write as (af/ay)y4s and (15.1) 


Instead of monotonicity we can use bounded Riemann sums. 
(15.11) R(#3x,ysh) = p(x) (x € E°) 


Theorem 15.2: Let (15.2) hold. If there are a real number A and a positive 
funetton § on ES such that for all §-fine divistons D of E and all positive while (15.10) follows from (15.5), (15.9), and 
funettons h on Er, 
(15.12) R(fsx,ysh) < q(x) (x € E°). 


(15.4) (D) ER(f;x,y,h(x))k(I,x) 2 A, 
If (15.2), (15.5), (15.11), (15.12) hold then (15.1) follows for right- 


hand derivatives. 


exists (finite) k-almost everywhere, 
The corresponding theorem for controlled convergence in one dimension 


+ 


(15.5) if lim R(f;x,y,h) = (əf/əy) 
h>0+ y 
is as follows. 
(15.6) and if lim inf R(H;E,y,h) ts finite, i 
h>0+ 
Theorem 15.4: A J (15.2) with E = [a,b), let 
then (3f/3y) k is integrable over E and iaiia fa aii 


R(f;x,y,h)dk = lim inf R(H;E,y,h). 


| (15.13)  R(H;[a,x),y,h) converge to (aH(La.x) y)/ay),,, as h + O+, 
E h>0+ uniformly for x in (a,b]. 


(15.7) Í (3f/3y) pdk < tim int 
E y h>O+ 


If with (15.5) and the same 6, D, h(x), (15.4) is replaced by (15.14) Zet the R(HsLa,x),y sh) be ACG* uniformly in h > 0. 


(15.8) (D) ZR(f3x,y h(x) )k(I,x) < B, Then (15.1) ts true for right-hand derivatives. 


Next follow necessary and sufficient conditions for (15.1) with right- 


(15.9) with lim sup R(H3E,y,h) finite, 
ho0+ p velit gma hand derivatives. 


then again (Əf/ðy) k ts integrable in E, with 
g y ys ens ce are Theorem 15.5: Assuming (15.2), (15.5), a necessary and sufficient 
A sondttion for the integrability of (3f/əy)., k over E, ts that for some 
(15.10) l (af/ay), dk 2 lim sup | R(f;x,y,h)dk = lim sup R(H;E,y,h). condi f nini f YW) 4 ce 
E y h>0+ E h>0+ compact set S of arbitrarily small diameter, two posttive functions 4, M 


aves c 
on ES, all positive functions h on EC satisfying h s M everywhere on E~, 
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and all §-fine divisions D of E, 


(15.15) (D) SR(f3x,y,h(x))k(I,x) € S. 


Given this integrability, a necessary and sufficient condition for (15.1) 
with right-hand derivatives, is that for some compact set T of arbitrarily | 
small diameter with SN T not empty, a J > 0, a positive function §(-,h) 

on ES for each h in 0 < h < J, and all 8(-,h)-fine divisions D of E, 


(15.16) (D) ER(fsx,y h)k(I,x) ET (O< hs J). 


The proofs of these theorems follow the proofs of corresponding theorems 
in Section 13, with y changed to h and f(x,y) to R(f;x,y,h). There is a 
similar set of theorems for left-hand derivatives. In order that two- 
sided derivatives occur, we take the limits in (15.5) and the analogue to 
be equal k-almost everywhere. 

In classical theory the differentiation of integrals relative to bricks 
in one dimension is appreciably easier than in n dimensions (n > 1), as in 
the latter we have to avoid needle-like bricks in order that Vitali's 
covering theorem can be used. Our theory gives the size of exceptional 
sets, showing that they are of inner variation zero, and then in special 
cases that they are of variation zero. Let E be an elementary set and ô a 
positive function on the closure E°, A definition from Section 7 is that 


a set Q of brick-point pairs is a 6-fine partial division of E if Q isa 
subset of a 6-fine division of E. A set C of brick-point pairs (I,x) is 
called an inner cover of a set X in ee if for each positive function 6 on 
E and each x € X, there is a 5-fine pair (I,x) € C with I c E. The set 

X has inner k-variation 0 if for each £ > 0 there is an inner cover C of X 
such that for each 6-fine partial division Q of E with Q c C, and k(I,x) 
defined for Ec, 


(15.17) (Q) Z|k(I,x)| < e. 


Sierpinski's lemma when n = 1, and Vitali's theorem when n > 1, are used 
to find in some sense a maximal Q out of C, to prove that the variation of 
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k over X is 0. 

Let p(1,x), q(I.x) be defined for E®, If, for a particular x € E a 
number d exists with the property that, given e > 0, there is a § > 0 such 
that for all -fine (1,x) with I c E, then 


(15.18) |p(I,x) - d.q(I,x)| < e|q(1,x) |. 
If another number d* (4 d) satisfies the condition then 
|d-d*| |q] = |p-d*q + dq-p| < 2 €|q|. 


For e = |d-d*|/4, q = 0 follows. Otherwise, if d exists it is unique, 
and we write d as d(p,q;E;x). It is independent of E if a sphere centre 
x, lies in E. We call d the derivative of p relative to q and E. As the 
set X of points x where q(I,x) = 0 for some positive function 6 on ES and 
all -fine (1,x), clearly has V(q;E;X) = 0, X can be disregarded. 


Theorem 15.6: (15.19) For E an elementary set and f(x), k(1,x) defined 
for E°, let fk be integrable to H on the partial sets of E. Then 
d(H,k;E;x) = f(x) except at points of a set X c EC with inner k-variation 
0. Also the set X, of all x where d(H,k;E;x) exists but is not f(x), has 


k-variation 0. 


(15.20) If (X5) ts a sequence of sets each with inner k-variation 0, the 


unton of the sets also has inner k-variatton 0. 


Proof: There is a positive function e on X such that for each positive 
function § on g and each x € X, 


(15.21) JH(I) - f(x)k(1,x)] 2 e(x)|k(1,x) | 
for at least one §-fine (I,x). Then X is the union of sets Xj on which 


2l-(x) > 1. By Theorem 5.3, for each positive integer j there is a 
positive function 8 on E° such that for each 5,-Fine division ur of È, 
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(15.22) (D.) z|H(I) -= #(x)K(Ix)] < e4, 
Taking ô = 5; in XK with Xo empty, 6 arbitrary in ~X, and D ‘a $-fine 
division of E, then by (15.21), (15.22), 


(D5 )E| KCL 4x) [x(X 55x) s e2, (D)Z|k(1,x)]X(X;x) $ £, 


and X has inner k-variation 0. The proof of (15.20) follows similarly. 

To show that X, has k-variation 0, in (15.21) we can take 
e(x) > |d-f(x)|/2, the inequality being true for all 6-fine (I,x) and some 
positive 6 on E°. Then Theorem 5.7 (5.22) with 2/|d-f(x)| replacing f(x), 
shows that V(k3E3X,) =: 0. 


Theorem 15.7: In Theorem 15.6 let n = 1 with k, fk, |fk| integrable in an 
elementary set E and k 20. If K ts the integral of k then 


lim 


vou+ K([u,v)) biel 


— |f(x)-F(u)|dk = 0 ([u,v) c E) 


except for x in a set of inner k-variation 0. Similarly for v > u-. 


Proof: For any constant c, the integrability of k, fk and |fk|, imply the 


integrability of (f-c)k; and |f-c|k, = max(f-c,c-f)k when f-c is real-valued, 


using Theorem 6.3; or by Ex. 6.4 when f-c is complex-valued. Thus by 
Theorem 15.6 we have 


Lim l 


— | |f(x)-c|dk = |f(u)-c| (Lu,v) c E) 
veut K(Cu,v)) ”[u,v) 


except for u in a set X(c) of inner k-variation 0. Taking c = p.279 + 
ir.275 when f is complex-valued, and +p.2 7 when f is real-valued 

(p.q.r.s = 0,1,2,...), the union X of the countable number of sets X(c) 
for the special c, is also of inner k-variation 0, by Theorem 15.6 (15.20). 
If b is any constant and c is one of the special numbers with 


led we, | F(x)-b[dk = | If(x)-c|dk 


[u,v) 


wal 
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<| Il #(x-b| =| #(x) -clldk 
4Tu,v) 


< |c-b|K(Lu.v)) < eK(Lu,v)). 


* 


For u ex X, some positive function 6* on an and |v-u| < 6* (c), Theorem 


15.6 gives 
| | FO)-el ak < K(Cu.v)) {]f(u)-cl + ©} sk(Lu,v)){]#(u)-b| +2e}, 
[u,v 
l ` |f(x)-bļdk < K(Cu,v)) {[f(u)-b] + 3e}. 
[u,v) 


The arbitrary constant b can depend on u. Taking b = f(u), the theorem 
follows. 


Theorem 15.8: If p(1,x), q(1,x) be defined for E° with q arbitrary and 
p VB* or VBG* in a set X, relative to E, then p(1,x)/q(1,x) (I cE) is 
either of the form 0/0 or is bounded as 8 > 0 for &-fine (1,x) (x € X), 
except for x E X,, X, being of inner q-variation 0. If \(p;E;X) = 0 then 


d(p,q;E;x) = 0 in X, except for a set of inner q-variation 0. 


Proof: By Theorem 15.6 (15.20) it is enough to take p VB*. If fora 
constant M, |p| > M|q| in an inner cover C, and if QcC is a 5-fine 
partial division of E then for small 6, 


M(Q) = Jal < (Q) Z |p| < (0) 2 |p| < V(ps63E3X) < œ. 
Where the ratio is unbounded, M > œ and the inner q-variation is 0. If 
V(p;E;X) = 0 then V(p;68;E;X) is as small as we please, whatever the M > 0, 
and by Theorem 15.6 (15.20) the set of all points with a fixed M > 0, has 
inner q-variation 0. Hence the second part. 
Theorem 15.9: Let p,q,",S be brick-point functions defined for E°, let por 
be variationally equivalent, and let q,S be variationally equivalent, both 


relative to X and E. Then, for x € X, either 


d(p,q;E; x) = d(r,s;E;x) 
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or else neither derivative exists, except for x in a set of inner q- 


variation 0. 


Proof: If d = d(p,q;E;x) exists for a particular x € Xn oe then omitting 
the (I,x), 


[r-ds| <|r-p| + |p-dq| +{d] |q-s|, V(r-p;E;X) = 0 = V(q-s3E3X), 


lA 


Ir-p| s elal, la-s| se {ql 


by Theorem 15.8, except for x in a set of inner q-variation 0. As 
Ip-dq| < elq|, |r-ds| s (e +e +/dle)|q], 


when (I,x) is 6-fine and 6 is small enough, except for x in a set of 

inner q-variation 0, and so of inner s-variation 0, as the exceptional 

set is in X. Hence the result as we can interchange p,q with r,s. 
This could have been used to prove the first part of Theorem 15.6. 
Two geometrical results now follow. 


Theorem 15.10: In one dimension let E be an elementary set, with C an 
inner cover of X formed of (1,x) with I = [x,u) c E for some u > x, and 
let k(I,x) be continuous in E? with finite V(k3E3X). Then, givene > 0, 
there ts a finite number of mtually disjoint intervals from C with union 
Y, such that 


(15.25) V(k3sEsX N Y) > V(k;E;X) - e. 


Let |k| also be integrable on E with the inner k-variation of X as 0. Then 


the k-variation of X is 0. 


Proof: Clearly we can assume that E is an interval L = [a,b) and then add 
up the results at the end. Let X; be the set of x € X for which there is 
an [x,u) from C with u-x > 1/j. Then X = lim X, so that by Theorem 5.11 we 
can choose j so that J3 
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(15.24) V(k;L;X;) > V(k;L;X) - Be. 


WA 


(15.25) Put a, = inf X} 2 a, b} = sup X; 


1 j b, c= by-a,> n = Be/(jct+1). 


By Ex. 5.10, V(k;I) is continuous as the ends of I vary, and if the 
interval J c I, then 


(15.26) V(k;L;X nJ) < V(k3I), 


IA 


the left side being continuous in J, so that there is a point x, € X; 
such that either X, = a, Or, by (15.26), 


(15.27) V(k3L3X Nn [a,>x,)) <n. 


Let [x, 0) € C with Us-X4 > 1/j. If points of X; lie to the right of 
Uy let ay be their infimum. Then either ay = Xo € X; or, by (15.26) and 
the continuity of h, there is an Xo € X} with V(k;L;X n [ay »X»)) <N. 
Let [x2u7) € C with Un-Xp > 1/j. And so on. If in J such steps we are 
at by» then by (15.24), (15.25), (15.27) and similar results, 


J 
[x.,u:), Z= U [assxX.), 
jt 9 z 


J 
c Cc 
Xe Sey Z, Y= 
sO Eas j ar as: 


1 


J 
V(ksL3X)-Be <W(ksL3X5) < W(k;L;X n yj Z V(ksLsCa, x5) 


j=! 
< V(k3sL3X nN Y) + Jn, 


by continuity of k. Also 
(J-1)/j <c, J < cj+1, Un < de, V(kKsL3X) < V(k3L3X n Y) + €, 


proving (15.23). Now let |k| also be integrable in E, so that the 
integral is V(k;E) by the Theorem 5.4 (5.13). If the inner k-variation 
is 0 let the inner cover C give (15.17), the members being 6-fine for 
some positive function § on E° that satisfies Theorem 5.3 (5.6) with h 
replaced by |k|, H by V(k;.), and 8e by e. Then by continuity of k, and 
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(15.23) , 


V(k3L3X n Y) = V(k;Y;X) < V(k;Y) = V(k;[x; 94 )) 


J 


w m a 


j=l 


J 
SS [kK([x. su 
=| J A 


¡x)| +e < 2e V(k3L3X) < 3e. 


As e > 0 is arbitrary, V(k3L3;X) = 0. 

The inner covers considered in Theorem 15.10 could be called right- 
hand inner covers. As we are in one dimension every inner cover of a 
set X can be split up into two inner covers, a right-hand inner cover 
C, of a set x, c X, and a left-hand inner cover Cy of a set Xo c X where 
X UX, =X and the members of C, are of the form (Lu,x),x) for some 
u < x. The proof of Theorem 15.10 also applies to left-hand inner 
covers, with obvious changes, so that if X is of inner k-variation 0 
then 


0 < V(k3E3X) < V(K3E3X,) + V(K3E3X5) = 0. 


For dimension n > 1 the next covering theorem needs bricks that 
are near to cubes. Let I be the Cartesian product of n intervals 
Ca; bj) (j = 1,...,n) and write 


the product of the lengths of the edges, t.e. the n-dimensional volume. 

This has to be used instead of a more general k(I,x), at least until 

Theorem 15.11 is improved. The coefficient of regularity r(I) =sup u(I)/u(J) 
for all cubes J> I. If d = HA (b;-a,) then r(I) = u(I)d". If there 

is a number a > 0 such that ail tne (I,x) € C have r(I) 2 a, we call C an 
a-inner cover, and this is used to define when a set X has a-tnner 

variation 0. In the definition of a derivative we also insert the 

condition r(I) 2 a. Theorem 15.7 can be given for n > 1 by replacing 
K(Lu,v)) by k(I,u) with u fixed, though to use Theorem 15.11 we have to 

take k = u. 
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Theorem 15.11: For fiwed a > 0 and the set X, let C be an a-inner cover 
of XN ES; and let F be the collection of closures of bricks that occur in 
the elements of C. Then there is a finite or countable sequence (1;°) of 
mutually disjoint closed bricks of F such that 


(15.28) VW(wsEsXNu 1.°) = 0. 
j= 


Proof: We define 1;° by induction. Let 1,” be an arbitrary closed brick 


of F. When the mutually disjoint closed bricks re have been 
defined, if their union U includes all X N EC we need go no further. 
Otherwise there is an x € X n ES that does not lie in U, which is closed. 
Thus there is a sphere S(x,6(x)) that is disjoint from U. By definition 
C contains a 6-fine (I,x), so that there are 1° € F that have no points 
in common with U. Let dj be the supremum of diameters of such 1°, Then 
d, is finite as the original elementary set E is of finite diameter. Let 
1;° be any one of those 1° with diameter greater than 4d.. Then the 
inductive definition proceeds. If we obtain an infinite sequence tr" 


J 
we put 


(15.29) Suppose that \(u;E;Y) > 0. 


As for each j, r(15) 2a > 0, we can associate with each 1,° a closed cube 
K; such that 


Ş : K. A 2 e)a 
(15.30) 1; c Kj» u(1,) au(K;) 
Let Ly be the brick that is a cube with the same centre as Kj but with 
diameter (4n+1) times as great, where of course n is the dimension. The 
following series is convergent since by (15.30), 

w(L,) = (4+1)? Z ulK,) (4n) a) E als) 
1 J j=1 J j= 


(15.31) 


im 8 
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Now W > 0 is finitely additive and continuous. Hence by Theorem 5.6, 
Exx. 5.8, 5.9 and (15.29), there is an integer N such that 


V(u;E; U Lj) 2 So Mises = g VG) 
j=n+1 j=N+1 J  j=N+ J 
= g wll.) < V(u3EsY). 
jeNe1 9 


It follows that there is an x € Y not belonging to any Lj (5 > N). By 
construction of Y, 


= c 
X ENU drain 


os A 


(15.32) Hence there are a sphere S(x454(x)) with no points in common 


wtth the closed 


and a closed brick JÊ € F with J c E and 6, -fine (J,x). 


Now the diameter of J, diam(J) > 0, while (15.31) is a convergent series 
so that u(K5) > 0 and Kj is a cube. Hence we cannot have, for all j, 


diam(J) < d, < 2diam(I 


j jet? < 2diam(K; ,,). 


So, for some j, d; < diam(J). By definition of di J must have points in 
common with 


i iac 
p=N+1 P 


Let s be the smallest j for which Jn 1,° is not empty. Then J is disjoint 
from 1,° for j = 1,2,...,s-1, and 


(15.33) diam(J) s d 4: 
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By (15.32), s > N, so that by definition of x, x ¢ Lo. Thus J contains 
points outside Ls and points of pS = gs But diam(L.) = (4n+1)diam(k ), 


so 
diam(J) > 4n(half edge of K) =2n(edge of K) > 2diam(K_) > 2diam(1, ).d 


contradicting (15.33). Thus from (15.29) we get a contradiction, so 
(15.29) is false and the theorem is true. 


Theorem 15.12: Let X be the union of sets X bo = 1/pr PS Trin) for 
which X, ts of a-inner variation 0, relative to the elementary set È. 


Then \(u;E;X) = 0. 


Proof: By Theorem 5.6 we need only prove the result for a constant œ > 0, 
Given e > 0, there is an o-inner cover C of X such that every partial 


division Q c C of E satisfies 


(Q) fu (1) < e 


Using this with Theorem 5.6, 15.11 (15.28), and Exx. 5.8 (for n > 1), 
5.9, as u > 0 is finitely additive and continuous, 


V(usEsX) s VQusEsXN UT.) + z VQu3E3sX 9 I.) 
j=1 j= 9 


V(u3sE31I.) = 
(u ; a 


Viu3I.) = 
1 j=1 J 


Being true for each e > 0, V(u;E;X) = 0. 

The proof of Theorem 15.11 is so rigidly dependent on the use of u 
that one needs to find what k(I,x) can replace u. For the definition and 
use of inner variation are for quite general k(I,x). Exx. 15.4, 15.5 
are offered as a start of this research, and one can combine the ideas in 


the two examples. 


Ex. 15.1. Let a<b, y 2 0, and consider 
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1 


l 7 dm (m([u,v),x) = v-u). 
[a,b) 1+ye 


f(y) = 


Find f'(y) and hence or otherwise evaluate the integral. 
(New University of Ulster 1974, M212) 
Ex. 15.2. By finding f'(y) show that 


f(y) = [ sec x log,(1 + ycos x)dm = m arc sin y (|y| < 1) 
of) [0 oT) 


(New University of Ulster 1975, M211) 
Ex. 15.3. If in n dimensions 
u(1) (ul) rational) 
0 (y(I) irrational) 


then any brick has inner h-variation 0 while V(h3I) = u(I), h is not 
integrable over any brick, and h is not variationally equivalent to 


V(h;-) in any brick. 


Er. 15.4 Let g:(x) be continuous and strictly increasing in (-~,+©) for 
1 <j <n. Show that in Theorem 15.11, u can be replaced by Ug? the 
product of differences g;(b,) - g;(a;). provided that a iiie change 
is made in the definition of the coefficient of regularity. 


re. 15.5. If 0 < f(x) < h(x) on EF with 


A 


f(x) u(1) < |k(1,x)] < h(x)u(1), 


then a set X of a-inner k-variation 0 is of a-inner variation 0, so giving 
V(u;E;X) = 0 and hence V(k3E3X) = 0. 
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1 (u<0=v) 

0 (otherwise) 
Then k = |k] is integrable over E = [-1,1) while each point x of E has an 
interval [x,v) attached with v > x and k = 0. Taking X = E, V(k;E;X) = 1 
but V(k;E;X N X4) = 0 in Theorem 15.10, showing that (15.23) is false here, 
and that the theorem needs a condition like the continuity of k. 


Ex. 15.6. (J.J. McGrotty) In one dimension let k([u,v),x) = { 


Ex. 15.7. In one dimension let f,g be point functions for I = [u,v) let 
A(f(I) = (v)-f(u). If V(AfAg;E) =0 show that f is continuous except in a set 
X with inner Ag-variation zero. If also g is continuous in E and VB* in X 
relative to E, then V(Ag;E;X) = 0. 


Ex. 15.8. In one dimension we can have v(AfAg;E) = 0 and yet l fdg and 
gdf do not exist. For take g(x) = x(all x) and f(x) = -1(xlog,x) 

(0 < x s 3), (0) = 0, with finite derivatives in (0,3) and AfAg 

continuous at the ends. Thus 


V(afagsL0.3) = 0, | f(x)dg = 10g,109,(1/£) - 109,109, 2 


[e,3) 


which does not tend to a limit as e > 0+. By Theorem 5.12, | g(x) df 


[0,4) 
does not exist either. 


Ex. 15.9. (a) Prove that, for all [u,v) in an elementary set E, 


J... faa = #(vda(v)-F(u)aCu) 
[u,v) 
if and only if V(g*af;E) = 0, where g* denotes that g is evaluated at the 
opposite end of the interval I from the x in (I,x). 

(b) If in (a), Xj is the set of x in EC for which |g(x)| 2 1/j, then 
prove that f is continuous in x. for each j, and the union X of the x," 
(derived sets) has inner Af-variation 0. 


(c) If in (b) f is continuous and VB* in E? prove that V(Af3E3X) = 0. 
(d) If [a,b] € Ks" for some j, prove that f is constant in [a,b]. 
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Hint: f(x){g(x)-g(t)} - {F(x)g(x)-F(t)g(t)} = g(t){F(t)-F(x)}, while 
in (d), X} is everywhere dense in [a,b]. Now see Theorem 3.5. 


Ex. 15.10. Let |k| and [k| x(X);.) be integrable in an elementary set E, 
where X is a set of x, and where we are in one dimension. Ifa<x<b, 
a < b, [a,b) c E, and b-a + 0, then V(k;[a,b);X)/V(k;[a,b)) + x(X3x) 
k-almost everywhere. 

In Lebesgue theory this is called the density theorem, and an x for 
which the ratio tends to 1, is called a point of metrice density 1 of X. 
In higher dimensions we have a similar result using bricks with 
coefficient of regularity 2a replacing [a,b) and with k =u. 


Ex. 15.11. Problem EFG 54 of the Bulletin of the Institute of Mathematics 
and its Applications (vol. 21, July/August 1985, p. 143) is as follows. 
A well-behaved non-negative function f satisfies f(u) + 0 as u > -æ and 


ka ú f(x)Pdm = (PR r #(x)dm)P 


for some constant p > 0, and m([u,v),x) = v-u. Find f. 
If p = 1 we only have the integrability of f. If p # 1 we differentiate 
to obtain, almost everywhere, 


IV 


(a) #(u)? = P( | #(x)ém)°"#(u) (u 20). 

+ (-,u) 
Assuming f(u) # 0, then f = g almost everywhere where (a) is true every- 
where when f is replaced by g. Thus f(u) = A exp(au) where A> 0 is a 
constant and a > 0 satisfies p = aP, 

Clearly trouble occurs when f(u) = 0. Let X, be the set of x where 
f(x) > 0, assuming that f is not 0 almost everywhere. Using Ex. 15.10 
let X be the set of points of X4 that are of metric density 1 of X4: 
Then Xi has variation zero and 


(b) f(u) = x(X;u)a f À f(x)dm 
SN 
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almost everywhere and f = g almost everywhere, where g satisfies (b) 
everywhere and so is AC* and monotone increasing, and, almost everywhere, 


g' (u) = x(X;u)ag(u), d 10g„9(u)/du = ax(X3u). 
Then for some b € X and for all x > b, 
10g„9(x) = aV(m;[b,x);X), g(x) = Aexp(aV(m;[b,x);X)) (x>b, x €X), 


where A is a positive constant. Similarly, for a positive constant Ay; 
we have 


g(x) = A,exp(-aV(m3[x,b) 5X)) (x < b, x € X). 


As b is a point of metric density of X4 and so of X, and by the continuity 
of g on X, we can let x > b to obtain g(b) =A= A,. Thus f satisfies, 
almost everywhere, 


x(X3x)Aexp(aV(m3[b,x)3X)) (x >b) 


F(x) = 
x(X;x)Aexp(-aV(m;[x,b);X)) (x < b) 


for constants A > 0 and b € X, such that f(x) + 0 as x + - œ in the set 
of variation zero where the last formula does not hold. A = 0 covers 
the case where f = 0 almost everywhere. 


16 Limits Of Step Functions 


A function g(x) is a step function in the elementary set E if there 
are a division D, enumerated as (154x5)(3 = 1,2,...,J0) and constants gj 
such that g(x) = 9; in the interior of I, (j= 1,...,d). As the y- 
variation of the boundary of a brick is 0, we see that whatever finite 
values g(x) takes on the boundaries of the I; g(x)y(1) is integrable 


in E with 
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(16.1) | aside = I, |f|du < V(HsE). 
E i= 


de u(I.). 
jet s 


J 


Proof: By Theorem 7.2 we can write H as the difference of two non- 


Theorem 16.1: Let f(x)u(1) be integrable to F(P) over every partial set negative finitely additive brick functions 
F of E. Then f ts the limit almost everywhere of a sequence of step 


functions, the integral of each step function over E being equal to F(E). (16.5) H(d) = 3{V(Hsa) + H(J)} - 3{V(Hsd) - H(J)}. 


i s . ; sie ehh te pn 
Proof: We can assume that E is a brick I, which we divide into 2” equal Thus We can assume HE D. 


s os pn y : : i 
bricks Ij (j = 1,2,3,...,2" ) by continued bisection with respect to each For numbers M > 0, a > 0, suppose that a set X has the property that 


co-ordinate in turn, so that the coefficient of regularity r(15) =r(I)>0. 


ak (16.6) for some a-inner cover C of X n E, H 2 Mu. 
à O 2. pn 
Sp(*) 7 i favuli) Mee ed > Views a2) Given e > 0, there is a positive function 6 on E° such that 
j 
gPa 1 2Pn (16.7)  V(H;8;E;X) < V(H;E;X) + €. 
(16.2) | s (x)du = Z {| fdu/u(1;) pulls) = Z l fau =| fau =F(1). 
p e j=1 VI, VS js JI, I 
J J By Vitali's theorem (Theorem 15.11) there is a finite collection Q of 


By Theorems 15.6, 15.11, F is differentiable to f except in a set X of disjoint bricks with union U, from C, such that 


variation 0, when the coefficients of regularity are bounded below away 


from 0. If x ¢ X and is not on any boundary then MEV(usEsX) - e}< M.V(u;E;X 9 U) < M.V(h;E;U) = M. (Q)Zu(J) 


s0) = F(15)/u(1) * #(x) < (Q)ZH(J) < V(H363E3X) < V(H;E;X) + €, 


as required, and hence the theorem. using (16.6), (16.7). As e > 0+ we have 


A further result, on the differentiation of finitely additive brick 
` functions of bounded variation, follows in a similar way. (16.8)  V(H3E3X) 2 M.V(u;E;X). 

If we can take M arbitrarily large then V(4;E;X) = 0. Also, if (16.6) 
holds for C and if H < Nu for another a-inner cover of X N ES, where M > N, 
then 


Theorem 16.2: Let the real H(I) be finitely additive in the bricks I cE 
and of bounded variation in the elementary set E. Then 


(16.3) f(x) = d(E;u;E;x) exists almost everywhere in E, and 
M. V(u;E;X) < V(H;E;X) < N.V(u;E;X), (M-N)V(usE3X) < 0, 
(16.4) if we put f(x) = 0 in the exceptional set then f and |f| are 


(16.9)  V(u3sE3X) = 0. 
integrable in E with 
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Taking M = (j+1)27P, N = j.2°P (j,p = 0,1,2,...), each set X satisfies 
(16.9). Taking a = 1/q(q = 1,2,...) and taking the countable union of 
all such sets with M,N,a, Theorem 5.6 shows that this union satisfies 
(16.9). In the complement of this union the finite derivative exists, 
proving (16.3). 

We again take each brick I of E and divide I into are equal bricks 
(j = 1,2,3,...,2PP) with s(x) = H(I;)/u(I;) (x € 1,°, J Nsa") 
and by finite additivity, 
2PN apn 
z (CH(T5)/u(15) uC) = 2 H(I;) = H(1). 


i= J 


j s,(x)du = j 


Remembering that H 2 0 and that Sp% > f(x), the derivative, except on 
the boundaries and except in a set of variation 0, Fatou's lemma (say, 
Theorem 9.1 (9.2)) gives the existence of 


| fdu = | lim s (x)du < lim inf | s (x)du = H(I). 
I JI pro P pro JI p 


For arbitrary H, d(H;u;E;x) is a difference fifo of two integrable non- 
negative functions, except in a set of variation 0, while 


WA 
WA 


[lel s f tydu + f odu s AWD) + 2 EVDO) 
sI I aE 


V(H31), 


giving (16.4). 


This section is tightly linked with Theorem 15.11 and so with u. Any 
widening of Theorem 15.11 will result in better theorems here. There are 
wider theorems in one dimension. 


Theorem 16.3: In one dimension let k(I,x) 2 0, defined for EC, be 
continuous there, and let f(x) be defined on E°. Let k and fk be 
respectively integrable to K and F over the partial sets of the elementary 
set E. Then there is a sequence (sp) of step functions such that 


lim s, = f k-almost everywhere in EC and 
p>% 
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Proof: K 20 by Theorem 4.4 (4.5) and is continuous by (5.44). If 
K(La,b)) = 0, then as K is finitely additive (Theorem 5.1), K(Lu.v)) = 0 
for all [u,v) c [a,b), and then 


j fdk = 0 

[u,v) 

by Theorem 5.7, changing k into K. This change also occurs even if K # 0. 
We now follow the proof of Theorem 16.1 using the continuity of K and the 


definition 


= Os p 
$y (x) = F(15)/K(15) (x E I, oJ E Ja 


2P 


J, spook = 


à : CF(1;)/K(1;)}K(15) 


1 
this being a Riemann-Stieltjes integral, and Theorem 15.10 replacing 
Theorem 15.12. 


Theorem 16.4: In one dimension, given an elementary set E let k(I,x) 2 0, 
defined for EC, be continuous there, and let h(1,x) be defined for ae 
such that h, |h|, k all be integrable in E. Then f(x) = d(h,k3E3x) exists 
in ES except for a set X of x of k-variation 0. If we put f = 0 in X, 
then fk and |f|k are integrable in E with 


A 


[_ Iflak < v(hse). 

E 

Proof: Follow the proof of Theorem 16.2, making obvious changes. 

Theorem 16.5: (16.10) If h is k-AC* (k-ACG*) relative to X and E, where 


P c á 
E is an elementary set, h(1,x) and k(1,x) are defined for E”, and k is 
continuous in EC and \B* in X, then h is VB*(VBG*) in X. 
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(16.11) In one dimension, tf h(1,x),k(1,x) are defined for the closure 
EC of the elementary set E, and tf h ts integrable in E and AC* relative 
to k, where k 2 0 ts integrable (and so VB*) in E and continuous in as 


then the finite derivative f = d(h,k3;E;.) exists k-almost everywhere. 


(16.12) In (16.11), putting f(x) = 0 where d(h,k;E;x) does not exist, 
then 


| dh = | fdk. 
I I 


Proof: As k is continuous, V(k;I) is continuous in I and 
(16.13) V(k;E;X n I) = V(k313X) (I c E). 


Thus by (16.13) we can divide up E into a finite number of bricks for 
which V(k;E;X n I) < 6, giving V(h;E;X n I) < e, for suitable ¢ > 0, 
§ > 0, as h is k-AC*. Hence h is VB* over X. 

For (16.11) we use Theorem 16.4. We cannot replace AC* by ACG* in 
(16.11) as h has to be VB*. 

For (16.12) with h integrable to H, and X the set of k-variation 0 
where the derivative does not exist, 


V(H-fk;E) < V(H;E;X) + V(fk;E;X) + V(H-fk;E;5X). 


On the right the second V is 0 by Theorem 5.7 (5.22), in the first V,H 
is k-AC* in X and so that V is 0, and the third V is 0 by the definition 
of the derivative. Hence the result. 

Let k(I,x) > 0, defined for E°. be continuous there are integrable to 
K in the elementary set E. A function f being the limit k-almost everywhere 
of a sequence (s09) of step functions in Ec, is said to be k-measurable 
in E. Thus the conclusions of Theorems 16.1, 16.3 are that f is y- 
measurable in E and k-measurable in E, respectively. The conclusions of 
Theoreins 16.2, 16.4 are that the derivative d(H;k;E;x) is k-measurable, 
where k = u in Theorem 16.2. 

We now look at another way of defining k-measurability. For a real 
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point function f and constants a < b let 
X(f = a), X(f > a), X(f z a), X(f <a), X(f z a) Xlag f <b), ... 


denote the sets of points x for which the appropriate equality or 


inequalities hold. 


Theorem 16.6: Let k(I,x) z 0 be defined on the closure E° of the 
elementary set E, with f,a,b as above, such that k, fk and |f|k are all 
integrable over E. Let X be any one of the sets menttoned above. Then 


k.x(X3.) ts integrable to V(k3E3X) over E. 


Proof: For X(f > b) see Ex. 8.4. Then x(X(f > b);.) is the monotone 
increasing limit of x(X(f 2 bal) as j > œ, while 


x(X(f < a);.) = x(X(-f > -a);.), X(X(f < a)s.) = X(X(-f 2 -a);.), 


x(Xla < f < b);.) = min (X(X(f 2 a);.),X(X(f < b);.)) 


WA 


x(X(f = a)3.) = min (X(X(F 2 a);.),X(X(f < a)). 


The integrability follows from Theorems 8.1 and 6.3. Similarly, omitting 
xX(X(f = a);.), from the integrability of any one of the other X(X;.)k 
for an everywhere dense set of a,b, with a < b, we find the integrability 
of the others. This gives the second definition of f being k-measurable. 
Also we can say that a set X is k-measurable if X(X3.)k is integrable. 

To tie the two definitions of measurability together, we have the 


next theorem. 


Theorem 16.7: Let k(1,x) 20 be defined for E° and let f(x) 2 0 in E° 
be such that X(X(a < f < b);.)k je integrable in E for all constants a,b 
satisfying 0 <a <b, or all such constants in a set Y everywhere dense 
in [0,°). Let (a.) c Y be a strictly increasing sequence with ag =0 €Y 
and a, >% as j >, %, and take g = aj in x(a; Sf < a.) = 01525 van le 


4 
J J 
Then g < f, and as the Xk are integrable, g is integrable to 
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(16.14) ja a V(k;E;X(a; sf <a544)) 

provided that this series is convergent. If divergent for a particular 
partition using the aj the sertes ts divergent for all partitions of this 
type, and g ts not integrable. If we take a sequence of such partitions 
of [0,%) (for once, having an infinity of partition points) in which for 
each j, 454474; +0, then g +f. Making each partition a refinement of 
the previous partition, the corresponding g are monotone increasing. If 
the integrals of the g all exist and the set of them is bounded with upper 


bound G then f itself is integrable with integral G. 


Proof: See Theorem 8.2. 


Note that if in (16.14) we omit the terms after j = N-1, then ay = A 
(fixed) and if 4444785 +0 (all j) we have the original definition of 
Lebesgue for the integral of functions bounded by 0 and A, except that 
Lebesgue uses measure instead of variation. It is just as easy to use the 
limit of the infinite series (16.14) for the integral of non-negative 
functions f. 

There is another definition of the measurability of sets. As we are 
keeping k and E fixed and varying X alone, for simplicity we can write V(X) 
for V(k;E;X). Then by Theorem 5.6 


(16.15) V(Y) < V(X n Y) + V(X), 
for all sets X,Y in a If for fixed X and every Y, 
(16.16) V(Y) = V(X nY) + VON), 
then we say that X ts a (Caratheodory) k-measurable set. 
Similarly we can look at the variation set of Section 7, writing VS(X) 
for the set VS°(k;E;X) which lies in VS°(k;E). If for some positive 


function § on ES. VS(k383E)° is compact, then by Theorem 7.6, 


(16.17)  VS(Y) c VS(X n Y) + VS(YXX), 
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where the right side is the collection of x+y for all x € VS(X n Y), all 
y € VS(YXX). If 


(16.18) VS(Y) = VS(X n Y) + VS(YXX), 


for fixed X and all Y, then X is called (Carathéodory) k-measurable 
(using variation sets). Note the similarity between (16.15) and (16.17), 
between (16.16) and (16.18). This similarity carries through to proofs 
of some results of both series. 

For the variation sets we assume (16.19), it is a simpler assumption 
than one having a different compact set for each X involved, though the 
second is more general. 

(16.19) For some positive function § on E let VS(k383E)° be compact. 
Theorem 16.8: If X is the union of a finite or infinite sequence (X5) 
of mutually disjoint k-measurable sets, then X ts k-measurable and for 


each set Yc R 


(16.20) V(Y) = 


1 ces 


V(Y nX.) + VY SX). 
j J 


If (16.19) holds, we can put VS wherever V occurs in (16.20). 


Proof: We use induction. For all Y c R” let 


p 
‘ J), ZS U Xe JZ 
v(Ya X) + V(Y p? př 


uo 


(16.21) V(Y) = 


j=l 


k-measurable. Replacing Y by Y N Zo we get 


imo 


(16.22) W(YAZ,) = 


V(Y Nn X.) 
j j 


1 


so that in particular Z is k-measurable (p = 1,...3q). Since Koel is 
k-measurable and disjoint from the k-measurable Ly 


| 
157 | 
| 


)= Yn Xg) +V(Ya ZX gat) 


uz Irs VY n xX 


ne 7 g + V(Y A Za $ VOYSZ 1) 


From (16.22) for p = q we have for p = q+1 the equality in (16.21) and 
so (16.22), and then Zoe is k-measurable. Thus (16.21) is true for 
each finite sequence, and true for all integers q when the original 
sequence is infinite, which we now assume. 

Up to this point we can replace V by VS, the proof is the same. Then 


\ 


V(YSX) < VESZ) 


1M oo 


VY n X) + VONK) s (Y) = z v(Y n x;) 
+ V(YXX), 
Letting p + œ we have (16.20), and by Theorem 5.6 X is k-measurable since 
Y(Y NX) +V(YNX) < 3 V(Y n X;) + V(YXX) = V(Y). 


To finish the proof for VS, 


p 
vs(Y NX) e vs(Y 5 Z)» Z vs(YN X5) + VS(Y x X) c VS(Y), 
j=] 
<% 
Z VS(Y n X) + VS(Y ~ X) c VS(Y). 
j=1 


As VS(Y) is closed it contains the closure of the left side, which is the 
right side of (16.20) as VS(Y ~ X) is closed, using (16.19). Hence the 
theorem for VS. 


If Yc X with Y k-measurable and X arbitrary, then from the Carathéodory 


definition, 


V(Y) + V(X NY) = V(Y NX) + V(X Y) = V(X), 


(16.22) V(X SY) = V(X) - V(Y). 
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But there seems to be no sensible subtraction operation for VS, and 
vS(X ~ Y) + VS(Y) = VS(X) (Yex) 


does not always give the value of VS(X ~ Y). However, we can prove 
k-measurability. 


Theorem 16.9: If Y c X with X,Y k-measurable, then X~ Y ts k-measurabte. 


Proof: For V we have to show that if 


ZN(XNY),Q=ZN(XNY)=ZN (KU Y), V(P)+V(Q) 


(16.23) P 


V(P u Q) = V(Z). 


To this end we put 


Q,=Q9NY, Q = OV, EY, Q EX, V(P)+V(Q) = v(P)+V(Q, )#V(Q5). 


Noting that 


PeX, Qe (KUY) n NY =K AW EX, V(P)+V(Q,) = W(P u 09). 
To obtain (16.23) we need only note that 
P=ZNXNWerr, PUQcrY, Q SY, WPUQ,)+V(Q,) = V(P UQ). 


As the proof only uses addition the result for VS follows in exactly the 
same steps. 

In the definition of k-measurability of a set X using V or VS, X and 
~X appear symmetrically, so that if X is k-measurable, so is XX. This 
with Theorems 16.8, 16.9, shows that the family M of k-measurable set 
contains xX if X € M; if (X5) cM is a mutually disjoint sequence with 
union X, then X € M; if Y c X with X,Y € M, then XY E€ M. We say that 
such a family is countably additive if it satisfies the first two results 
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omitting the XXY result, and including that the empty set is in the family. 


Here it is trivial that the given M contains the empty set. 

If the V are replaced by the values of a measure, (16.14) gives a 
Lebesgue-type definition of the integral as the mesh of the partition 
tends to 0. Then Theorem 3.4 shows that the integral produced is included 
in the generalized Riemann integrals, which links up the last definitions 
of k-measurability using V with the earlier ones. There is probably a 
much simpler proof of the link. 

Finally we can give a result linked with Theorem 11.1, and a small 
point in the proof of Lemma 10.1. 


Theorem 16.10: The function M of Theorem 11.1 can be k-measurable. If 
(11.1) ts true, tf f(x) 2 0 for all j,x, and if the compact set S is 
fixed, the k-measurabtlity of M implies that (11.2) is true. 


Proof: As fik is integrable, f, is k-measurable. (In dimension n > 1 
we have to take k = y.) Hence so are f and farf Hence we can take M 
k-measurable. Conversely, with the given conditions, let p,(x) be the 
indicator of the set of x where M(x) < j. As M is k-measurable, so is 


Pye and as fa is integrable over E and 0 < fB < fj» then F 5? is 
integrable. Further, f.p. > f everywhere as j >œ. If m is a positive 
integer valued function then m(x) < M(x) gives Pin(x) &*? = 0, and 

m(x) = M(x) gives Pin(x) *) =, As f} 2 0 and S c [A,B] for finite A,B, 
then for all -fine divisions D of E, (11.4) gives 


0< (0) E falx) CP (x) Kx) <s Bis 


m 
and F 5p, satisfies (11.4) with no restrictions on the positive integer 
valued m. Thus f is integrable using Theorem 9.1, last part, which 
proves Theorem 16.10. 

Also, in Lemma 10.1 each fj is m-measurable by Theorem 16.1, in the 
sense that f; is the limit almost everywhere of a sequence of step 
functions, so that the limit function f is also m-measurable, and Yo is 
m-measurable and satisfies Carathéodory's condition on Yo and “ys giving 
what is required. 
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Ex. 16.1. If h is the function of Ex. 15.3 then the ratios h(1)/y,(I) and 
u(1)/(u(1)+h(1)) are bounded, but the corresponding derivatives do not 
exist anywhere, showing that the proof of Theorem 16.4 needs the 
integrability of h, |h], k, or something similar. 


- -1 2 
Ex. 16.2. For 0 <x <1 let f(x) = y(log.y) 3/2 (y = x) with f(0) = 0. 
Show that for 


f-etx 3, | fdu = 2(10g.2)~2 -2(10g (t71)? ‘ 2(1og.2)"* 
[t,3) ẹ 


as t > 0+, 


so that fu is integrable over [0,3) to 2(10g,2)"? by the Cauchy limit, 


Theorem 5.14. Show that s (2771) = 2P.2(plog,2) 2 so that if g 
majorizes the sequence (s(x), g is not integrable over [0,3). 


Ex. 16.3. Let F be a closed set in R” so that ~F is open, and let G(c) 
be the open union of spheres with radius £ > 0 and centre each point of 
F. Show that as e > 0, ~G(e) > ~F, by noting that round each point of 
\F there is a sphere free from points of F. For E the usual elementary 
set anda set Y c ae prove that 


V(k;E;Y n F) + V(ksEsYSG(1/j)) < W(Y), 


by considering (I,x) in G(1/(3j)) and in a union of spheres with radius 
1/(3j) and centres the points of ~G(1/j). Go on to prove that F is 


k-measurable by showing that the second V tends to V(k;E;Y Ē~ F) as j >>. 


Thus prove that G.-sets are k-measurable. The outer k-measure k*(X) of 
an arbitrary set X c EC is the smallest V(k;E;H) of all Gs-sets H > X. 
Show that 


(16.24) V(k;E;X) < k*(X). 
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CHAPTER 6 
CARTESIAN PRODUCTS AND THE FUBINI AND TONELLI THEOREMS 


17 Fubini-type Theorems 


Let m,n be positive integers with sum N. Then the co-ordinates 
(xj... oXy) of points in RN can be separated into two collections, one of 
m co-ordinates and one of n co-ordinates. The first collection can be any 
m of the N, provided that we take the same co-ordinate of each point in 
RN, and then we can rearrange the co-ordinates so that the chosen m come 
first, (xate The second collection then becomes (Xatt? XN n 
in number. Let X c R", Y c R. Then Z = X x Y denotes the set in RN 
formed of all (x,y) with x € X and y € Y, and set X x Y is called the 


Cartesian product of X and Y. Thus R” x R? = RN. We can define bricks 


a aUi I and elementary sets Er, A E in R”, ao RN respectively, and 
can define the integration process in the three spaces in the way of this 
book. 

If E = E” x E" we have two kinds of results, respectively associated 
with the names of Fubini and Tonelli. In the first kind we integrate 
over E and find what this implies for integrals over Ee” and E” separately. 
In the second kind, without looking at E we consider when can E" and E” 
be interchanged, leaving the value of the double integral unaltered. As 
usual we use the Pythagorean distance in RP (p = m,n,N), the square root 
of the sum of squares of co-ordinates. For the first kind of result we 
find connections between divisions of E and divisions of E” and En, the 
idea behind the construction occurring in lectures on Lebesgue integration 
by J.C. Burkill and based on a paper by W.H. Young; but the construction 
is vastly different from that in Lebesgue integration. 


l@mma 17.1: Given the above definitions let § be a positive function on 
cee oe. Tedeh point x € EM tth 84x) = 272 8(x,y), let there 
correspond a §,,-fine division D(x) of E”. Then there is a positive function 
bo on E™ such that if (1",x) ts S9-fine and (1",y) € D(x), then 

am x 1,(x,y)), written i x ayj is §-fine. 
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Proof: Let D(x) consist of (1. y3) (1 <j <r) and define 
S (x) = min 27? alx, yl) > 0. 
1sjsr 


If (I™x) is 65-fine and if (I™,y?) € D(x) then I" x I™ Ties in the N- 
dimensional sphere with centre (x,y?) and radius 


Hbp(x)® + a Slayt)? < Vly)? + aalay) = 80y). 
Thus (I™,x) x (IM y9) is 6-fine. 


Theorem 17.2: (Fubini-type) Let the real or complex valued k(I™,x), 


mon 
q(x31"y) be defined in EP (p = m,n, respectively). If for 1=1 xI, 


z = (x,y), 

(17.1) h(d,z) = h(I" x IP,(x,y)) = KCI x)q(xsI",y) 

is integrable to H over the partial sets of E, then 
ao = fp dalast™) 


exists except for the x € X" with V(k;E™;x™) = 0. If we put Q(x) = 0 in 
xX”, the integral of Q(x)k(1™,x) exists in E" with value H(E), so that 


il 


(17.2) H(E) f d(kq) = E iin dq(x31",y)} dk(1™,x). 


Note that (17.1) is unsymmetrical. If it also holds when x,y are inter- 
changed, then for all EE S if k(1™,x) #0, 


n 
k(I™,x)q(xsI",y) = k,(1" yay (ysI" 5x)» q(x3I.y) 


= ky (Iya, (y; 1™",x)/k(1™,x). 


1 


In the last equation the left side does not depend on 1". Hence 
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a, (y3 1™,x)/k(1",x) 


is a function, say f(x,y), of x,y only, unless k,(I",y) = 0. Hence we 
have 


(17.3)  h(1,2) = f(x,y)k(I",x)k 0"), 
which is also true when k = 0 or ky = 0, or both. (17.3) is a more 
traditional symmetric form, but the unsymmetrical form has been used, see 


Section 20. 


Proof of theorem: Given ¢ > 0, let the positive function & on E° be such 
that 


(17.4) (p)£th(1,z) - H(I)| < € 
for all 6-fine divisions D of E, which is possible by Theorem 5.3. By 


Lemma 17.1 there are suitable D(x) dividing E", and a positive function 


S% for Pe, such that the conclusion of Lemma 17.1 holds. Let D” be a 


-fine division of E. Then 
(17.5) (o™) z [k(1",x) (0 (x)) Zq(xsI™,y) - HIT x E")| < e. 


Let X™ be the set of x where for all positive functions 6 on E° there 
are at least two 27Żs(x,y)-fine divisions D,°(x), 1,” (x) of E” for which 


(17.6) |(D,(x)) za = (03"(x)) Z ql > 2i, 


Using D,"(x) Lemma 17.1 gives a positive function as (j = 1,2) and we can 
take 6, = min (6,', 69°). Thus from (17.5), (17.6), 


(oT) zik(1™,x) (XI x) 27 < 2e, V(k36 E"; x) < 2+, 
2 


As e > 0 is arbitrary, and by Theorem 5.6, 
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K", 


(17.7) W(k3E™3X™) = 0, V(k;E™;x™) = 0, X 


= uU 
j= 
If x e~ X" then for each integer j there is a 6; (x,y) > 0 such that, by 
(17.4), 


(17.8) (o)z}h(1,z) - HCD) < 49, 
(17.9) |(0,"00)) za - (D,"(x)) z al < ad (x e~ XM) 


for all 8.-fine pi and all 23 6.(x,y)-fine pairs of divisions 0,°(x), 

2," x) of E", so that (0"(x)) zq is fundamental (E). By Theorem 4.3, q 
is integrable over E", say to Q(x). In (17.5) we now have to replace 
(o"(x)) zq by its limit Q(x) to finish the proof. Replacing 5; by 

min (54 28p+++985) > 0, we can assume that ô} is monotone decreasing in j. 


7 mage A P 
For x e~ X" let D(x) be a 2° 6 ;-fine division of E” with 


f(x) = (OM (x)) z q(xsI",y). 


By 


(17.9) with ô) monotone decreasing in j so that D(x) is 24 6,-fine 
(reid 


), 
(17.10) e60 FOO] 29 (roads Ifj 00] ¢ ZF Ox eX"), 


Replacing D'(x) in (17.5) with e = 479, by p(x) and by oS (yy with 
6, to fit in with these divisions, and 0” any 65-fine division of pM 

-j 
(17.11) EIK] [F544 (4) - #5000] < 2.4 ~. 


From (17.11), with x™P the set in xx" where 


i -j-p 
IF 544) F500) | >2 s 


2i-PykE; KIP) 5 Vek lfp Fj 1E") 5 2.4%, 


8 


. =t mj 
(17.12)  V(k;E; XIP) < 2179*P, v(k;E";Y"P) < 214P (yP = v, x™P), 
J= 
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Using (17.7) and putting 
s(x) = 2717P e YMA cy y Ê yey ye (PrN), 


(17.13) WsksE™3z) < 1, Z= u Puw, 
p=1 


If x & Z then x & X™P for any j,p, and so 


(17.14) 


I 
oO 
v 


(x) -f 00] < APON iph. Igt -fl 


(Tiai ja! 


for x €& Zand all j. By (17.10), given e > 0, in ZX" we can choose 
the integer t = t(x) so large that 


(17.15) Ifig - Q(x)| < s(x)e, 


while t(x) is 274s .(x,y)-fine. By Lemma 17.1 we can find suitable ô,, 
D™ to fit with 0" (x) (x E& Zand x € X™), prt) iy) (x € Zk"), shan’ by 
(17.13), (17.14), (17.15) the replacement can take place. 

The form (17.1) seems to be the most general for which the conclusion 
of Theorem 17.2 can be proved. But, assuming a little more, we can deal 
with general h. 


m n 
x E , as 


Theorem 17.3: Let I= I"x I" be the bricks in RN, and E = E 
before. For z = (x,y) and h(1,z) defined for EC, let h be integrable over 
E to H, and for each fixed (1x) let h(I" x 1",(x,y)) be integrable 
relative to (1".y) over E” to ott" al. Given £ > 0 let there be a 
k(1™,x) defined for E™ and of variation less than € over E", and a 
positive function 8" on ie , such that 


(17.16) |@(I™,x) - (0) z hO" x 1, (xsy))] < [k(1™,x)| 


for all 6"-fine divisions D” of E”. Then there exists 
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(17.17) | de(1™,x) | 3 Hi y dh(1™ x 1",(x,y))} = | dh = H. 
em X dE JE 


Proof: There is a positive function S™ on E™ such that 


(17.18) (0) x|k(1™,x)| < V(k;E") + e < 2e, 


for all -fine divisions D” of E”. There is a positive function 6 on ES 
such that (17.4) is true. For each x € E"? Jet 


8,M(ysx) = min(2"*6(x.y)s S'yD) > 0 


and let 0} N(x) be a ô} N fine division of E”. Let ig be a positive 
function on ETC that corresponds to the various 0," (x) and the 6, as in 
Lemma 17.1 and let 


moo. m m 
& = min (ô, 5 6). ris 


Then a 6,"-fine division i," of E satisfies 


A 


|(0,") zG - H| |(0,") zG - (0,") £(0,") zh] + |(0 1")z(0,") Eh-H 


A 


(0,") r|k| +e 


by (17.16) and (17.4), since the division of E eave teat of the Cartesian 
products of the fre 0," with the (I",y) € i , is 6-fine by Lemma 
17.1, and then by (17.18), 


\(0,") ZG - H| < 3e, 


1 


proving (17.17). 

If, for a given a function 6" , we define hi mx) | as being the 
left side of (17.16) with oa -fine division of E", and if (17.17) is 
true, then the variation of k is arbitrarily small by (17.4). 


Ex. 17.1: Let f(x) be a point function on the real line and let H be a 
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finitely additive function of rectangles I x J for intervals I and J on 
the real line. Show that if fH is integrable on T = [a,b) x [u,v), 


le fdH = a is f(x)dH(I x [u,v)). 


Proof: Let 6 be a positive function on T° such that for all 6-fine 
divisions D over T, 


IO) EFOOH x 3) = [| tanl <e. 


Let D"(x) be a 2725(x,y)-fine division of [u,v) for each x € [a,b], with 
ô% > 0 on [a,b] as in Lemma 17.1, and o a ôp-fine division of [a,b) 
(m=1 =n). Then 


(0) z f(x) (0 (x)) EHC x J) - {| fdH| <e, (0"(x) )ZH(I xJ) 
JT 
=H 


(I x [u,v)). 


Ex. 17.2: Let h(x,d) be finitely additive in J c [u,v), with k = k(I), 
I c [a,b), where the brick-point pair in RÊ is (I,x) x (J, y). If hk is 
integrable over [a,b) x [u,v), show that the integral is equal to 


| h(xsfu,v) )dk(1). 
{a,b 


Ex. 17.3: Let h,k be VB* in intervals [a,b], [u,v], respectively, on the 
real line, and let W be a set in RÊ with indicator X, with Wy the set of 

y for the fixed x, such that (x,y) € W, the section of W for the constant 
x, and with W the section of W for the constant y. If |hk|X is integrable 
over [a,b) x [u,v) = T then 


V(k3Lusv) sw, ) d|h| = | V(h;Ea,b);W,) d|k|. 


V(hk3T3W) = | 
[u,v 


3 [a,b) 
Hint: Put the variations as integrals. 


Ex. 17.4: Let f(y) and h(x3J) be given in [u,v] for all x € [a,b], with 
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k(I) VBG* in [a,b]. If fhk is integrable on T to H(T) then 


J(x) = | fdh(x3J) 
[u,v) 


exists except for x in a set X of k-variation zero. If we write 0 for 
J(x) in X then Jk is integrable over [a,b) to H(T). If also |fhk| is 
integrable on T then 


J*(x) = |f|dv(h;.) 


bass 
exists except for x in a set X of k-variation zero, and, writing 0 for F 
in X, J°V(k3.) is integrable to V(fhk;T) in [a,b). 


18 Tonelli-type Theorems And The Necessary And Sufficient Conditions 
For Reversal Of Order Of Double Integrals 


Here we have the second kind of problem, the section's title being 
good description. In symbols, taking integration over E™ to be the 
integration of Theorems 13.7, 13.8, and integration over E" to be the 
limit process Te we look at 


m ; n n 2 
(18.1) FF ifin f(x,y)dk}dh =| (Lin (0") z FLKy KCI" y) dh 


. n n 7 f(xy)dh}dk, 
Lim 0") z kU") fa f(xsy)dh P Je (xsy)dh)dk 


where D" is a general 6-fine division of E" and where lim is the limit 
60+ 
as 6 shrinks. 


Here, monotonicity does not seem relevant, but the analogue of the 
bounded Riemann sums test is very relevant and leads to the following 
theorem. 


Theorem 18.1: (18.2) For em an elementary set, h(1,x) = 0 on oe and 
f(x,y) real-valued for x € pe ye ee let (x,y)h(1",x) be integrable 
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over E" for each fixed y E ee. so that tf 0" is a fixed division of E", 


the f(x,y) of Theorems 13.7, 13.8 is replaced by 
(18.3) (0") zf(x,y)k(I",y) 


which when multiplied by h(I",x) ts integrable over E", We also assume 
that (18.3) is integrable over E" for each fixed x € pS, 


(18.4) Let 0” be a positive function on ETC, Zet B < C be wo real numbers, 
and let 


B < (0) 5 ¢(0(x)) cF(x,y)k(I",y)3H(I™,x) < C 


for all -fine divisions 0" of E" and all collections of divisions D(x) 
of E”. Then (18.1) holds. 


Proof: The step function approximation of the integral over En, from 
Theorems 16.1, 16.3, needs the limitation that k is continuous or even the 
volume function, until these theorems can be improved. However, for a 


step function sp) if D(x) is a refinement of the partition using the 
brick "steps", the sum over D"(x) is equal to 


Jon Pay Idk, 
E 
and (18.4) simplifies to 
Bs (0) ch(I",x) | a f(xy)dh < € 
JE 
which is true for (18.4) as it stands. We now use Theorem 9.1 to finish 
the proof. 


We now come to the necessary and sufficient conditions for (18.1). 


Theorem 18.2: Let h(1™,x) and k(I",y) be defined and VBG*, respectively 
for Ee. ge with (18.2). In order that 
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(18.5) fim E fdk}dh = F 


exists it is necessary and sufficient that for some compact set S of 
arbitrarily small diameter, some positive function p" on EMC 5 some positive 
function 6" (y;x) on ENC for each x in gm wit 3 -fine divisions 0" of em 
and all §"(y3x)-fine divisions D(x) of E”, 


(18.6) (0™) zh(1™,x)(0"(x)) 2 f(x,y)k(",y) € S. 


Given (18.6), a necessary and sufficient condition that (18.1) holds is 
that for each e > 0 there is a positive function 8" on E'S such that for 
all 6" -fine divisions p” of EB", a positive function é"(xsy) on gue and 
depending on 8", all 6"(xsy)-fine divisions D"(y) of E", and a compact 
set T of diameter < £ such that S N T ts not empty, with 


(18.7) (0") s{(0™(y)) z #(xsy)h(I™,x)}K(I",y) € T. 


Proof: If (18.5) exists then, given e > 0, there is a positive function 


a on p" with 
(18.8) Fee < (0") zh(1™,x) | n fk < F +e. 
E 


As h is VBG* there is a positive function s(x) such that sh is VB* 
(Theorem 5.9) so that we can also assume that for some constant M> 0, 


(18.9) (D) zs |h] sM. 


Also there is a positive function S” (y,x) on EPC such that for each 6"-fine 
division D"(x) of E”, 


(18.10) |(0(x)) z f(x,y)k(I”,y) - | n fdk] < es(x). 
E 
Putting together (18.8), (18.9), (18.10) we have (18.6) with S = 


[F - e- eM, F+e+eM]. Conversely, if (18.6) is true, then as S"(y,x) +0 


we have 
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m m 
(D) Eh(I ,x) [s fdk E€ S | 


and (18.5) exists as S has arbitrarily smal] diameter. Next, if 


(0°) Ek(I",y) | m f(xsy)dh = P ((0") 2 F(x,y)k(1" sy) dh 


E 


tends to F as 6” > 0 on E'S with é"-fine D" then, given £ > 0, 


Fee < | m A0) EF xsy)K(I"y)} dh < F + e, 
E 


and using the analogue of (18.10), for a suitable positive function 
s™(x;y) on E™ and all 8"(x3y)-fine divisions oy) of EM, 


F -e< (D°) ck(I,y)(O™(y)) of (xsy)h(I™,x) < F +e. 


Thus we can take T = [F - e, F + e], of diameter 2e, while S and T have 


the point F in common. 


Conversely, let (x;y) > 0+ such that the inner sum tends to 


| ñi fdh. Then 
E 
(0?) sk(1",y) | fdh eT. 
E 


As T has an arbitrarily small diameter, the last sum tends to a limit 


Fe = le (lon fdh}dk . 


As S,T always have points in common, F* = F and we have proved the result. 

Let us now take integration over E" to be the integration of Theorems 
13.7, 13.8, and integration over E" to be the limit process lim, i.e. let 
us interchange the roles of the two integration processes. Then it is easy 
to see that the conditions (18.6) and (18.7) interchange, which gives a 
check of the theory. 

We have replaced f(x,y) in the general theory of Section 13 by (18.3). 
If we do the reverse operation in Section 17, replacing expressions like 
(18.3) by f(x,y) or by fa) we obtain an analogue of the double integral, 
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involving sequences of integrals. 

For 4 = llor let F5k be integrable over the elementary set E. 
If there is a number F such that, given s > 0, there are an integer J 
and positive functions 5; on EF with 


(18.11) F-e< (D) fjg) OOH) <Fre 


whenever the integer-valued function j(x) 2 J and Dis a S; (,) "Fine 
division of E, then we say that F 5 (x) OOH) is integrable to F over E. 


Theorem 18.3: If h zs VBG* in E° and for each fixed integer j, F sh ts 
integrable in E and if F5(y) Ooh ts integrable over E, then for some set 
X of variation zero and for x the indicator of XX, FAX satisfies (11.1), 
(11.2), (11.3), except that h ts VBG*, not VB*. 


Proof: Writing F as F(E), by the usual proofs in Section 5 the integral 
F(E,) exists over every partial set Ey of E, and is finitely additive, 
and for every j(x) = J and every 6; (x)-fine division D of E, 


(18.12) (0) 5 IF 5 (x) hx) - F(I)| < 8e. 


If q(x) is another integer-valued function on Ec with q(x) 2 J, if 


ô = min (Sig) 5g (x) (x)) and if D is a 6-fine division of E, then 


Let Xn G E° be the set of all x in E° having the property that, given J, 
there are two integers j,q in q > j 2 J and depending on x, such that 


(18.14) |f; (x) - F(x) | 4 Afr (6 = 1525205) 


writing j = j(x), q = q(x), if X is the indicator of Xps and X the union 
of the X,» for every 6-fine division D of E, (18.13), (18.14) give 


(D)z|h(I,x)|xX s 16re, V(h3E3X,) < 16re. 
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| 
| 


This is true for every J, so that we can let e + 0+ and have 
V(h3E3X,) = 0 = V(h3E3X). 
If x €\ X, then given the integer r > 0, there is an integer J. such that 


if ;(«) - f (x)| < 1/r (all q> j 2 d),) 


q r 


and (f.(x)) is a fundamental and so convergent sequence in j, with limit 
f(x), say. Thus if x* is the indicator of xX we can replace f, by fjx*, 
leaving integrability unaltered, together with the integrals, and the new 
f. > f everywhere, and fh is integrable on E to F. As h is VBG* there is 
a function s(x) > 0 such that V(sh;E) < 1. Given e > 0, there is an 
integer-valued function p 2 J on EC such that 


Ifo) > FOO] < es(x). 


p(x 


For Da 55 (x) Fine division of E, (18.11) gives 
|\(D) £f(x)h(1,x) -F | <e + €(D) & [sh] < 3c. 


Finally, taking j(x) constant at j 2 J in (18.11) with D 6;-fine, and then 
by the definition of the integral F, of f jh over E using a 6.*, we can 
replace ô} by 85° = min (8556,*) and let D be 6,°-fine, so that (18.11) 

is true and 


(0) 2F;00h(1x) = Fy] <e |F) -F| <2 (j2 J) 


implying that Fy > Fas j> œ, This completes the proof. 

Note that (18.11) and (11.4) are effectively the same. However, for 
(18.11) we assume j(x) 2 J, where J is a constant, and for (11.4) we 
assume j(x) = M(x), and M(x) need not be constant. 


3 


Ex. 18.1: Let f(x,y) = 2(x-y)(x+y) ~ (x2 0, y z 0, x+y > 0), f(0,0) = 0. 
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Then as the calculus integral is a generalized Riemann integral, keeping 


y fixed, 


| f(x,y)du = [-227! + aye) (z = x+y), = -2(tay)*, 
[0,1 


Integrating this relative to y over (0,1) gives -1. Further, noting that 
f(x,y) = -f(ysx), integrating over y first and then x, we get 1. Thus 
this example cannot satisfy the necessary and sufficient conditions of 
Theorem 18.2, and it cannot satisfy the conditions of Theorem 17.25 SO 
that we cannot integrate it, multiplied by the volume function, over 
(0,1) x (0,1). Can these two results be proved directly? Let x and y 
lie in (0,1), and let L(x,y) be the main square, (0,1) x [0,1) minus the 
rectangle [0,x) x [0,y), so that 


L(x,y) = Lasy) ú {Cx,1) x [0,y)}. 


Show that the integral over L(x,y) is (y-x) (yx) 7), If x > 0 first, and 
then y > 0, the limit is 1, while if y > 0 first, and then x + 0, the 


limit is -1. 


Ex. 18.2: In Rê let f(x,y) = x/y when y > 0, and f(x,0) = 0. Show that 


there exists 


f(x,y)dx}dy, but that | f(x,y)dx}dy 


baa Ae sa [0,1) 


does not exist, so that the repeated integral does not have one property 
of the integral in RÊ, Does this imply anything about the latter integral? 
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CHAPTER 7 
APPLICATIONS 


19 Ordinary Differential Equations 


Given an (n+1)-dimensional open set G of points (x,y), x being a real 
number and y an n-dimensional vector, let (a,b) € G. We study differential 
equations of the type 


(19.1) dy/dx = g(x,y), 


where g is a function whose values are finite n-dimensional vectors. We 
study the existence of a solution y = f(x) with f(a) = b, such that 


(x,y) E G when |x-a| sas for some a > 0. We can assume that x 2 a as the 
same methods can be used when x < a. The (generalized Riemann) integral 
integrates all finite-valued derivatives, so that we can integrate (19.1) 


to obtain 


(19.2) y -b= Í ) eya 
a,x 


where by dx we mean du with u([u,v),x) = v-u. Conversely we obtain (19.1) 
(at least, almost everywhere) on differentiating (19.2). 

In (19.2) we can relax the conditions on g slightly, and g(x,f(x)) can 
behave badly in a set of x of variation zero without altering (19.2), 
whatever conditions are later imposed on g. 

Beginning trivially, if y =b is a solution through (a,b), then by 
(19.2) the integral of g(x,b) is 0 in some neighbourhood of a, so that 
g(x,b) itself is 0 almost everywhere in that neighbourhood. Conversely, 
if g(x,b) = 0 almost everywhere then y = b is a solution through (a,b). 
There may be other solutions through (a,b). 

Carathéodory showed that at times (19.2) has a solution, and here we 
go far beyond the cases he treated, supposing that for some a > 0 the 
method works over the range [a,a+a], writing 
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b (a <x sata/j), 

7 b + | l 
J [a,x-a/J) 

Here, fj is defined as a continuous function in (atpa/j,a+(p+1)a/j] by 
using its values in [a,a+po/j]. This is an inductive definition for 
p= 1,2,...,J-1. Of course f. is not usually a solution of (19.2) because 
of x-a/j in the integral of (19.3). Carathéodory's idea is to take a 
subsequence of j tending to infinity, for which the f; tend to a limit 
y = f, which would be a solution of (19.2) if the limit could be taken 
inside the integral and inside g. To this end he assumes that g(x,y) is 
continuous in y for almost all x, and that for a suitable modulus or norm 


g(t,f.(t))dt (ato/j<x sata). 


J+ | in R”, some integrable m(x) 2 0, all y, and almost all x, 
(19.4)  |g(xsy)| < m(x). 


Then the £5) are uniformly bounded and equicontinuous, and by Ascoli's 
theorem a subsequence is uniformly convergent on [a,a+c], and gives a 
solution. 

A more general condition for limits under the integral sign is given 
in Theorem 9.1, last part. Here the condition is as follows. For a 
compact set S in ot some positive function 6 on [a,a+a], all 6-fine 
divisions D of [a,ata), and all f : [a,ata] > R", 


(19.5) (D) Z g(x,f(x))(v-u) E S. 


If S lies in an n-dimensional cube with edge length A, then as in the 
proof of Theorem 9.4,(19.5) gives 


(19.6) (D) E[g(x,f(x)) - g(x,b)| (v-u) < 2nA. 


In this we put the f, of (19.3) for f and take 6*-fine divisions on each 
[u-a/j,v-a/j) with (Lu,v),x) € D and let 6* shrink suitably. Thus 
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(19.7) (0) Ele stv) -fyt - | g(t,b)dt| 
J * [u-a/j,v-a/j) 


= oif {g(t.f,(t)) - g(tsb)idt] < 2nA, 


[u-a/j,V-a/j) 


provided that one division-point of D is ata/j. Below this number, terms 


of the sum are 0. Thus 


(19.8) f,(x) - 


| q(t,b)dt 
zi [a,x-a/j) 5 


is of uniform bounded variation in x and j. As (19.8) is Oat x =a, 
Theorem 2.10 Corollary shows that a subsequence of (19.8) tends to a finite 
limit. By continuity of the integral of g(t,b), the same subsequence of 
(f.(x)) tends to a finite limit, say f(x), everywhere in [a,ato]. Assuming 
that g(x,y) is continuous in y for almost all x, the same subsequence 
of (g(x,F5(x))) tends to g(x,f(x)) almost everywhere. Then (19.5) and 
Theorem 9.1 show that for the same subsequence, the limit can be taken 
under the integral sign, y = f(x) is a solution of (19.2), and f is 
continuous. 

We have thus proved the following result. 


Theorem 19.1: For a compact set S in RS some positive funetion § on 
[a,a+a], all -fine divisions D of [a,ata), and all f:[a,a+a] > Rì’, let 
(19.5) be true, and let g(x,y) be continuous in y for almost all x. Then 
there is a continuous solution y = f(x) of (19.2) through the point (a,b) 
in grt, 

The result (9.5) in Theorem 9.1 is useful and is stronger than the 
Arzéla-Lebesgue (19.4), and is very near to the set of necessary and 
sufficient conditions of Section 11. So the question arises of whether 
(19.5) can be replaced by modifications of the necessary and sufficient 
conditions of Theorems 11.1, 11.2. We could replace the f.(x) of these 


theorems by g(x,f,, (x)) with f(x) given by (19.3) and with (r;) an arbitrary 
subsequence of the positive integers, chosen so that (E is convergent 


everywhere, if this is possible. These seem to be the widest conditions 
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for which a Carathéodory-type proof works using (19.3). However, it 
seems that the resulting modifications of (11.4), (11.9) would be too 
complicated and impractical, though they could easily be written if ever 
needed. 

There are simple solvable differential equations that are not 
covered by the above proofs, for example, g(x,y) = g(x)/h(y) where 
h : R?” > R and h(y) > as y tends to some vector b, where g(x) is 
integrable in some neighbourhood of x = a, and where h(y) ig integrable 
in some neighbourhood of y = b. Then (19.1) becomes 


h(y)dy/dx = g(x), | h(y)dy = | g(xdex 


in the respective neighbourhoods of b and a, by integration by substitution 
(Theorem 5.7 (5.25)) on each co-ordinate of y separately. For example, 


hlyjse Yp) = (yf sea!) (p constant in -1<p<0), b=(0,...,0). 


To combine the two kinds of results let there be an integrable h(y) 
for which g(x,y)h(y) satisfies our generalization of Carathéodory's 
conditions. Then 


| h(y)dy = | h(y)(dy/dx)dx = | h(y)g(x.y)dx 


and there is a solution. 
The solution is unique if 


(19.8) — |g(x.yy) - g(xsyo)| < LCyy-¥o|) 


where L(r) is continuous in 0 < r < k and some k > 0, L(0) = 0,L(r) > 0 


< 
except at 0, and | ) dr/L(r) does not exist. Also 
[0,k 


(19.9) L(r) can be replaced by g(x)L(r) if also 4 20 is integrable 


on [a,ato). 
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20 Statistics And Probability Theory 


Applied mathematics, or natural philosophy, is the application of 
pure mathematics to the natural world. The scientist's experiments or 
observations, and data (numerical or otherwise) have to be interpreted, 
and the mathematician gives him various mathematical models to that end. 
The scientist then selects the most suitable model and the values of 
parameters, predicting from the model what his experiment should show, 
and sees whether further experiments fulfil the prediction. If not, the 
model is modified or discarded. Nothing, or almost nothing, of this can 
be exact; exactness can normally only occur in pure mathematics; experi- 
mental error occurs in various forms. For example, in the amplification 
of varying electric currents a random noise is produced by the thermal 
motion of electrons in the amplifier, and this can often be avoided by 
using a maser operating at very low temperatures. Experimental error has 
as many forms as the number of kinds of experiments performed, and it 
increases as we pass from physics to chemistry and then biology. In 
subjects such as economics, anthropology, linguistics, one cannot 
experiment, one can only collect the data as they occur, and the error is 
usually greater than that in physical systems. The study of such data, 
and the errors of experiments, has produced two more branches of applied 
mathematics, those of statistics and statistical physics. In other 
branches it is usually vital to smooth or rationalize, to obtain a 
suitable pure mathematical model. If it gives correct predictions we 
replace the experimental results by the model. This occurs in statistical 
physics, the basic model being the theory of probability. But in 
statistics, though the model is roughly the same, we cannot discard the 
raw material and use the model as this is the source of many fallacies. 
We have to keep separate the raw material and the pure mathematical model, 
being repaid by a resulting greater precision of possible statements in 
the subject. 

The first operation in statistics is that of classification. If the 
data consist of single real numbers x we can divide up their range into a 
finite number of disjoint sets for example, the sets given by 
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Xeon, -n<X<-ntt, -N+1 <x <-n+2,..., 0<xX<1,...5n-1 <X <n, XN, 


to take a very simple case. If the data are a fixed number p of real 
numbers we can represent each value as a point of RP and we can again 
divide up the space into a finite number of disjoint sets. In practice 

we cannot observe a complete infinite sequence of real numbers, but we 

may have data in which p can be arbitrarily large, for example in 
sequential sampling, in which case we use an infinite dimensional 
Cartesian space which can be divided up in a suitable way into a finite 
number of disjoint sets, by dividing the space only with respect to a 
finite number of co-ordinates and leaving the rest undivided. We can 
observe continuous functions, an early example being the traces produced 
by a barograph, and a more modern example being heart muscle action 
recorded on a cathode ray tube, and we need a space of continuous functions 
that can be divided up by, for example, taking a finite number of values 
of the independent variable or variables and classifying the values of the 
functions at those points. Some observations cannot be measured, or can 
only be measured at great inconvenience, for example, the grading by eye 
of colours or shades; here, classification becomes difficult. 

After classification we are given a finite number, say, S4 Sose ee Sh 
of disjoint sets with union the whole set of possible data values, and 
the given finite number of data values from the experiment or other 
research. We count the number Nj of times that the data values fall into 
the set Sj» for j = 1,2,...,m, the total number of values being N, the 
sum of the N.. The relative frequency of values in the set S; is defined 
to be A = N;/N, a ratio lying between 0 and 1, the sum of the relative 
frequencies for j = 1,...,m, being 1. If j # k, the relative frequency of 
values in Sj U Sk is fj + fk > so that the relative frequency is finitely 
additive. 

If we have made a proper classification and if we increase the amount 
of data, it usually happens that the n" tend to fixed values. If they do 
not, it may be an accidental oscillation or it may be that an important 
source of variation has been neglected. For example, we might think that 
a process is independent of time, but the collected data might show a rise 
in time. In the pure mathematical model of the relative frequency we 
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attach a number Pj» called the probability of Sis to the set S; Get 
...,m), that lies between 0 and 1, the sum for j = 1,...,m being 1, such 
that the probability of S; U Sk is Pj + Pk (j # k) so that the probability 
is finitely additive. The problem is to link Pj with fj » an obvious way 


being to write pj = lim ie but this cannot be given a rigid interpretation. 
N>% 


First, in practice we cannot observe all terms of an infinite sequence 
without taking an infinite time over it. Secondly, even if we could, and 
if the limit exists for one set of data, nothing ensures that the limit 
will exist for another set of the same kind of data, or that the two 
limits are the same, except trivially when fN = 0 for all N, or A =1 
for all N, and similar cases. von Mises defined a Kollektiv as an 
infinite sequence (x;) of 0's and 1's with the properties that the relative 
frequency of 0's in the first N members of the sequence tends to a limit 
as N tends to infinity, and that the same is true, with the same limit, 
for an arbitrary subsequence of the (x5). By taking the subsequence to 
consist of 1's alone, or of O's alone, it is clear that the only Kollektivs 
are those sequences for which either X= 0 for all j greater than a 
fixed J, or Xj = 1 for all j greater than a fixed J. It was not the 
intention of von Mises to restrict his Kollektivs so drastically, so that 
some restriction has to be applied to the choice of subsequence. von Mises 
made the choice without knowing the values x., but the accurate definition 
of such ideas requires probability, so that the argument is circular. 

Let us dig more deeply, writing each (x;) as a decimal in the scale 
of 2 and between 0 and 1, x = 0, ey Each number in [0,1] gives 
a sequence (x5). and different sequences (x5), (y;) can only give the same 
x when there is an integer J such that x, = y, when j < J, while one 
sequence subsequently takes the values 0, and 1 repeated, the other taking 
the values 1, and 0 repeated. These x are multiples of powers of 3, and 
their set is countable and can be neglected. Now let constants My smg have 
sum 1 and lie in (0,1). There is a continuous non-negative finitely 
additive interval function p satisfying 


p(0,1) = 1,p(2m.27], (2m+1)27}) = m,p(m.2-*", (met )2-9*!), 


p((2mt)27J, (2m2)27) = myp(m.2-9*!, (met)29*"), 
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The set Xun of all x with XK tXote es tXy = M, is a finite union of intervals 


and 


i o Nc NM M 
(20.1) V(p3£051) sXpqn) = My Mos as Wye 


say. Given e > 0, let k be the integer next lower than Nm, + Ne. Then 


for j>k 
wW, (N-j)m (N-Nm,-Ne )m 1-£/m 
l te Es eee um é, 02 e # Is 
j (j+1)m, (Nm +Ne )m, I+e/m, 
Thus if Yy is the set where Xy tives # Xy ? mN + 2eN, we use (20.1) to 
obtain 


V(p:£0,1)5¥y) = W, < Ha /(1-) < c&N-1 (4-6), 


E 
j>m,N+2eN 
V(p300,1)3 u Yy) < Eta- 
N2M 


. 


Hence the set Zo where lim sup (x4+x2 T a xy)/N > m, + 2e satisfies 
N> 


V(p3£0,1)32,) = 0. Similarly for lim inf (x, tXo + eee + Xy)/N < My-2e. 


N> 


Taking s = 1/m, m = 1,2,..., we see that, p-almost everywhere, 


(20.2) Lan (x, + Xp + vee + xy )/N = My. 

Changing mo in (0,1), we do not change the limit of a particular Kollektiv, 
but we change the sets of variation zero. There is no criterion depending 
on (x;) alone, to tell us whether the corresponding x lies in the set of 
variation zero, or whether x lies in the complement of that set, the 
criterion must depend on the particular value of m chosen, m, must be 
chosen first, and its value cannot be deduced from the behaviour of (x5). 
This is a deeper reason why we cannot define probability as the limit of 
the relative frequency, unless we choose a special m, a priori. G.A. 
Barnard has suggested that for p we use the Haar measure, which means that 
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here we take m, = 3. 

Another definition is that probability is the degree of rational 
belief, and by using various axioms a theory can be built up. But the 
resulting probability is rather subjective and there seems to be no 
method of evaluating probabilities in statistics. 

Probability can also be defined as a pure mathematical measure with 
various properties, but divorced from numerical data. 

Each of the above definitions seems to hold part but not all of the 
ethos of probability in statistics, so we use a blend of all three, 
assuming that the applied statistician's numerical work is correctly 
carried out, but perhaps wrongly interpreted. Here we concentrate on 
tests of significance. When considering a possible statistical experiment 
we usually assume that a certain effect will not occur, known as a null 
hypothesis. Using our rational belief or otherwise in the experiment, 
we assign a number § in [0,4], called the level of significance. If the 
null hypothesis is a basic belief we put § = 0 and have no need to do the 
experiment. If, however, the null hypothesis could be false, we take § 
positive, with value such that we accept a situation in which we reject 
the hypothesis, when true, in a proportion § of cases. The more costly a 
rejection is, the nearer we take § to 0. Experiments in parapsychology 
have the smallest §, we test whether we can obtain information of the 
natural world or the thoughts of an agent, or can move bodies by ways 
that do not use the known senses or sciences. Many would say that such 
is impossible, putting § = 0. Others, conceding that the effect might 
happen, would have to be thoroughly convinced first, and these put 
§ = 10710 or some still smaller positive number. 

Having chosen §, we arrange the test procedure so that when the 
nypothesis is true we reject it in a proportion § of cases, and accept it 
in a proportion 1-5 of cases, both proportions being true in the long 
run. The meaning of “in the long run" is defined mathematically, using 
probability in the pure mathematical sense. The choices of a suitable 
probability measure and of the significance level, are known as the problem 
of specification., This is often governed by the knowledge of previous 
experiments, by some theoretical argument, or by some hypothesis to be 
tested. 
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We arrange to obtain a sample of m independent values Yge Ym of 
a variable from the experiment that has not yet begun. The integer m 
may be fixed or may itself be a statistical variable. Pure mathematics 
can partially but never wholly solve the problem of ensuring that the 
values are independent, there is still much in the design of the 
experiment that has to be left to the individual assessment of the 
applied statistician. The probability measure of the variable gives 
that of the sample, and we choose a set C(6) of samples that has 1-6 for 
probability measure, where 6 is the significance level. If the variable 
is discrete this is not always possible, and then we sometimes use sets 
C(6) with probability measure greater than 1-6. For variable m, the 
C(6) depends on m, and then we also use C(n) for nin (6,1) and near to 
1, with C(6) > C(n). 

For fixed m we carry out the experiment and obtain the sample. If 
in C(6), the sample is compatible with the null hypothesis and is not 
significant, relative to Sand C(6). Otherwise the sample is tncompat- 
thle with the hypothesis, and is significant, and the null hypothesis is 
rejected. For variable m, if the sample is in C(n), the sample is 
compatible with the hypothesis and is not significant, relative to nand 
C(n), and the null hypothesis is accepted. If the sample is not in C( 6) 
it is incompatible with the hypothesis and is significant relative to 6 
and C(6), and the null hypothesis is rejected. If the sample is in C(6) 
but not in C(n) we take a further value Yna and retest. Sometimes there 
is a ceiling M beyond which m is not allowed to go. 

The next idea is that of independence. If a variable x ranges over 
a set X and a variable y over Y, then (x,y) ranges over X x Y. Supposing 
that there is a probability measure p, over sets in X, a Py over sets of 
Y, and p over sets of X x Y, if 


p,(I)py(d) = pI xa) 
for all bricks I € X, J CY, we say that x and y are independent. Strictly, 
this is not a property of the variables x,y, but of the models PysPy>P 
that we are using. This property is often tested by using contingency 
tables, and is assumed when we take a sample of m independent values. If 
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for N independent samples the sample is in C(6) exactly M times, we use samples satisfies 

independence and the property of combinations as in the binomial theorem, 

to show that the probability measure of this is Noy (1-6) "aN, Thus as in V(p3S3U) = V(p;S;U(r)), V(psS;UNU(r)) > 0, 
the proof of (20.2) we see that M/N > 1-6 as N > œ, except for a set of 
samples of probability measure 0. This gives a pure mathematical 
definition of the statement that M/N tends to the right value in the long 
run, and it is the simplest of such convergence results, deeper results 


then 


V(q3S3U) > V(q3S3U(r)). 


Proof: Using (20.3), and the variations as integrals, if (20.4) holds 


also being true. 
for all s € U, a measurable set, then 


There are many regions C(é), so the question arises, which is the 
best? Our construction can be described by saying that we commit an 
error of the first kind when we reject the null hypothesis when it is 
true, and we fix the probability measure of the error at 6. We also 
commit an error of the second kind if we accept the null hypothesis when 
it is false, and we wish to minimize the probability measure of errors 
of the second kind. These two probability measures are not the same, the 
former being p, say, and the latter being an alternative probability 
measure, say q. Note that we can only test p against a single function 
q using these ideas, and so only one alternative hypothesis, unless 
several hypotheses give rise to the same q. 

When p,q are continuous in one dimension, they are non-negative and 
clearly AC* with respect to p+q, and then by Theorem 16.5 (16.11), (16.12) 
there exist f,g 2 0 such that 


V(q3S3U) < rV(p3S3U). 
If instead, for all s € U, a measurable set, 
(20.5) g(s) > rf(s) 


then either V(q;S;U) > rV(p;S;U), or there is equality, in which case U 
has p-variation zero. For if X is the indicator of U, 


ls x- (g-rf)d(p+q) = 0, X+(g-rf) = 0 
(p+q)-almost everywhere and 


(20.3) p= | fdtpidls 4 < | gd(p+q): 0 s V(p3S3U) < V(p+q3S3U) = 0. 
Now sets U(r)ṣU and UNU(r) are measurable and are in regions where s 


In dimension n > 1 we have not proved a similar result, because of the 
satisfies (20.4) and (20.5), respectively. Hence 


rigid geometrical proof of Vitali's theorem, but we can assume the 
existence of f,g, as (20.3) can be proved in another way. 

V(q3S3UNU(r)) > rV(p;S;UU(r)) = rV(p3S3U) - rV(p3S3U n U(r)) 
Theorem 20.1: Zet S be the set of all samples, supposed a finite or 


infinite brick, and U(r) the set of samples s satisfying = rV(p3S3U(r)) - rV(p3S3U N U(r)) = rV(p;S;U(r)xU) 2V(q3S3U(r)SU), 


(20.4) g(s) < rf(s), V(q3S3U) = V(qsS3UNU(r)) +V(qsSsUNU(r)) >V(q3S3U NU(r)) +V(q3S5U(r)sU) 


for r a real constant. If U(r) is measurable and if a measurable set U of = V(q3S3U(r)). 
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Thus if 1-V(p;S;U(r)) is used as a significance level, U(r) is the 
best possible set to distinguish between p and q. If g = rf for some 
constant r and in some measurable set U of positive p-variation, then 
V(p3S;U(r)) is discontinuous at that r. Otherwise we can find an r to 
give the required significance level. 

If there is more than one alternative probability measure q, the 
best possible region U(r) for one q might not be the best for another q, 
so that we have to make a compromise. But here we do not go further into 
such details. 

Various integrals are associated with the probability measure p of a 
variable x, of the form j f(x)dp, where T is the finite or infinite brick 


T 
over which x ranges. f(x) = x gives the mean y, and for f(x) = ea we 


have the variance, the square root being the standard deviation. For a 
constant t, f(x) = xt gives the tth. moment, and for various values of t, 
the generating function of p. f(x) = gt gives the moment generating 
function, and f(x) = e*t the characteristic funetion, which is nothing to 
do with the characteristic function of a set. 

Let x,y be independent variables on the line with continuous 
probability measures p,q respectively, and let X(t) be the measurable set 
of (x,y) with x+y < t, so x $ t-y. If P is the whole plane the probability 


measure of X(t), and so of x+y as t varies, is 

f XOLE) (xay) alpa) 

P 
where X(X(t);(x,y)) is the indicator of X(t). By Fubini's theorem, 
Theorem 17.2, and the continuity of p, this integral is equal to 


co, co) 


|e F hi P x(X(t) (x,y) ap Ja = | att-yide, 


the convolution of p and q. The way is now open to the Central Limit and 
allied theorems. Next we look at Karl Pearson's correlation ratto Nyx 
connected with the dependence of two variables x,y. The probability 
measure form is as follows., 

Let the probability measure associated with points z = (x,y) of the 
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plane P be obtained from p = | f(z)dA, A being the area function, to avoid 
l confusion with the mean. We write 


a(x) = | Foxy ddy > 0, Fe) = f yf(x,y)dy/g(x), 
y= | yf (z)dA, 
P 
(20.6) EA = [orn a] (y-y) F(z) dA. 


| (y-¥(x))7#(z)dA = | (y-7)*#da +2] (y-¥) (¥-y(x) fA f (5-5 (x))2#¢A 
P P P P 


u 
-_—— 
~ 
< 
<1 
~ 
rm 
=e 
a 
> 
1 
uu 7°. 


(20.7) ne = | (5-9 (x))2q(x)dx/ |, (y-7)7#(x,y)dA. 


yx 


Thus from (20.6), (20.7) we can prove the following respective 
results. 


Theorem 20.2: If ie" +1 then y = y(x) except in a set Z of (x,y) with 


V(psP3Z) = V(fA3;P3;Z) = V(A3P3Z n Zo) =0. 


where Zo ts the set where f #0. Ifn 
X of x with 


yx. = 0 then y(x) = Y except in a set 


vf gdx 3 (e400) 5X) = V(gAx; (=0,0);X) = vf Figg idan 


Í s 


(0,0) 3X) = 0. 


Thus, apart from sets of probability measure 0, there is in the first case 
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a complete functional dependence of y on x, while in the second case the 
mean of y for each x does not vary with x. In the very special case when 
both are true, so that y is almost everywhere equal to a constant b, then 


y = b also and 


f (-W?Flxy)dh = 0. 
P 

Finally, in T = R? let x = (Xp see0oXn)> let T(x) denote the infinite 
brick consisting of all w= (Wyses Wa) satisfying Wj < xj (3 = Tasia) 
and let p be a probability measure on R”. Then F(x) = p(T(x)) is called 
the distribution function of p. Let G(t,x) be a distribution function 
int= (ty..+.,t,) E R” with probability measure g(x,I) (I € R™), for 
p-almost all x, and for every t let G(t,x) be a p-measurable function of 


x. Then the function 
H(t) = | G(t,x)dF(x) 
T 
is a distribution function in RY, with probability measure q, say, and 
q(I) = | g(x3I)dF(x). 
T 


These follow from Fubini's theorem (Theorem 17.2). 
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CHAPTER 8 
HISTORY AND FURTHER DISCUSSION 


21 Other Integrals 


At the beginning of the book, Section 1 gives a brief introduction 
to the origins of integration and the integral of Cauchy, Riemann, and 
Darboux, followed in Section 2 by those of Stieltjes, J.C. Burkill, and 
Moore and Smith (refinement integral). Then Section 3 goes back to the 
calculus (Newton) indefinite integral, and forward to the modern 
generalized Riemann integral. This last includes the previously 
mentioned integrals. Those in Section 1, 2 that are not refinement 
integrals are defined by 6-fine divisions with § a positive constant. 
For refinement integrals see Theorem 4.6 Corollary. By (3.16) the 
calculus integral is included. In time, Riemann's integral was followed 
by Lebesgue's, and Theorem 3.4 shows that it is also included. The 
inclusion of the calculus and Lebesgue integrals make it not surprising 
that the first integral to include these, the special Denjoy integral, 
is itself included in the generalized Riemann integral. This follows 
since the Cauchy extension of integrals included in generalized Riemann 
integrals and over expanding intervals or bricks, is also included, 
Theorem 5.14, while the Harnack extension of generalized Riemann integrals 
is a generalized Riemann integral (Theorem 5.15). These two extensions 
were applied by Denjoy first to Lebesgue integrals and then to the 
integrals so found, in a transfinite inductive process. Thus at every 
step the integral is a generalized Riemann integral. Denjoy proved that 
in integrating derivatives the process terminates after an at most count- 
able number of steps. 

0. Perron (1880-1975) defined an integral in 1914 using derivates 
in one dimension that is easily shown to be equivalent to the variational 
integral of the interval-point function of the type f(x) Ax, which in turn 
is equivalent to the corresponding generalized Riemann integral (Theorem 
5.3 and following definitions). As H. Hake proved in 1921 that the special 
Denjoy integral is included in the Perron integral and as P. Alexandroff 
in 1924 and H. Looman independently in 1925 proved the reverse inclusion, 
we have a second proof of the equivalence of the Denjoy special integral 


191 


of f with the generalized Riemann integral of fAx. See Theorems 21.1, 
Delis 

In 1912, N. Lusin (41) gave a descriptive definition of the Denjoy 
integral of f over [a,b]. It is AF where F is ACG* and has f as derivative 
almost everywhere, and in recent years several have proved that Lusin's 
integral is equivalent to the generalized Riemann integral of f(x)Ax in 
one dimension, which give other proofs of equivalence. See Yoto Kubota 
(31), and Lee Peng-Yee and Wittaya Naak-in (36), for example. Here we 
define the Perron integral in the following way. For [a,b] cR, b-a 
finite, let f,M be functions from [a,b] to RI. Then the lower derivate 
M' of M is the lower limit of AM/Ax as the interval shrinks round a fixed 
point, with appropriate one-sidedness at aand b. M is defined a Perron 
major function of f in [a,b] if M is continuous on [a,b] with M(a) = 0, 

M' 2 f almost everywhere on [a,b], and M' > - » except for a countable 
set X on [a,b]. 

This definition is in a sense an amalgam of the worst features of the 
definitions of 0. Perron (51), H. Bauer (2), H. Hake (17), H. Looman (40), 
S. Saks (54), pp. 186-203, and E.J. McShane (43), p. 313, except that 
McShane used two functions like M', and that our f is always finite-valued. 
The last causes no trouble since if a function is Perron integrable it is 
finite almost everywhere, we can replace f by 0 in the exceptional set, 
and we can include it with the previous set of variation zero. 

The infimum of M(b) for all Perron major functions M of f on [a,b], is 
called the upper Perron integral of f on [a,b]. An m:[a,b] > R! is a 
Perron minor function of f on [a,b], if and only if -m is a Perron major 
function of -f on [a,b]. The lower Perron integral of f on [a,b] is the 
supremum of m(b) for all such m, and if the upper and lower Perron 
integrals are equal, their value is the Perron integral of f on [a,b]. 
Further, we say that M is a strong Perron major function of f on [a,b], 
if and only if M is continuous on [a,b] with M(a) = 0 and M' > f everywhere 
on [a,b], while m is a strong Perron minor function on f on [a,b] if and 
only if -m is a strong Perron major function of -f on [a,b]. 


Theorem 21.1: The upper and lower Perron integrals of f on [a,b] are 
unaltered if the Perron major and minor functions of f are restricted to 
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strong Perron major and strong Perron minor functions of f, respectively, 
and in consequence, if f is replaced by an f* : [a,b] + R! with f* = f 
almost everywhere. 


Proof: Using the continuity of a Perron major function M of f we remove 
the sequence (x3) of points where M' = - œ. Let 


L(x,H) = sup |M(x+h)-M(x)], L(x,0) = 0, L(x,-H) = -L(x,H). 
O<|h|<H 
Then for each fixed x, L(x,H) is bounded and continuous in H and monotone 
decreasing to 0 as H+ 0+. Thus, given e > 0, there are an Hy > 0 and 
L(x) such that 


(2.4) i < oO! te oe a 


Kl j j 
(21.2) L(x 54x) = (L(x.,H.) - L(x5.-H;) (x 2 Hs)» 
0 (x <s -H;) 


Then for 0 < hs Hy, using (21.1), (21.2), 


Lj(xj+h)-L; (x5) 2 [M(x;+h) - M(x) |> Lj(xj)-L;(x;=h) z [M(x 5)-M(x;-h]) 


0< Z L.<¢ ec, andM, =M+ FL 
=1 J 1 = 


is continuous in [a,b] with M > 0 at each Xj: As a countable set is of 
measure or variation zero, My is a Perron major function of f with X empty. 

Next we remove the exceptional set Y of measure or variation zero where 
W < f. For each j there is an open set G; > Y of measure or variation 
less than e.4 1. We define 
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29 (x € &.) as P(t) with P(a) = 0, and let m tend to P. Then M-P is monotone 
ERS 
I increasing and non-negative, and similarly so is P-m. Then from M'> f 
y. (x) = z.(x) = y.(x)dx,z(x) = p z.(x),z (b)<e274 and all intervals in the neighbourhood of a fixed end-point, we have 
j j j z a d odl : 
[a 5x) j=1 
0 tx ESG.) AM > fAx, AM-AP 2 fAx - AP, Am-AP < f(x)Ax - AP, 
ye? 


|fax - AP| < (AM-AP) + (AP-Am) = A(M-m), (D) Z|fAx-AP| <(D) ZA(M-m) 


Then Zj and z are continuous and monotone increasing in [a,b]. Putting = M(b) - m(b). 


Mo(x) ğ M(x) + alx) + e(x-a), M; ER Given e > 0, by choice of M,m, this difference is less than e. Hence P 
is the variational integral of f over [a,b]. Conversely, if P(t) is the 
variational integral of f over [a,t) (a < t < b), then using Theorem 5.3 
M, = -œ (x €Y), M 2zf+e>f (x € [a,bNY). (5.6) with h = fAx, H = AP, and e for 8e, and a suitable positive 
function ô, 
Thus the continuous My is a strong Perron major function of f on [a,b] 


and Theorem 21.1 follows from M(b) < M (b) < M(b) + e(2+b-a). fax < |fAx-AP| + AP <V(fAx-AP;Ax) + AP = AM, fAx 2 AP-V(fAx-AP3Ax)=Am, 
Theorem 21.2: If M,m are strong Perron major and strong minor functions, mos fs M', V(fAx-AP; [a,t)) 

respectively, of f on [a,b] then M-m ts monotone increasing. Further, if 

the Perron integral exists over [a,b] it exists, say as P(t), over [a,t], | being monotone increasing in t. Hence M and m are Perron major and minor 
fora <t <b, and if P(a) = 0 then P is the variational integral of f functions, respectively, of f over [a,b], and as the variation is as small 
over [a,b). Conversely, if the variational integral exists, so does the as we please, P is also the Perron integral of f. 

Perron integral, and the two are equal. This section shows that several integrals are included in the 


generalized Riemann integral in Euclidean space of a finite number of 


Proof: By Borel's covering theorem and using in [u,v] c [a,b], and Í dimensions. However, the methods used have a far greater scope than is 


m' = -(-m)', shown in this book. The original theory was based on a battery of axioms, 
but in preparation of the book, Henstock (27), J.J. McGrotty suggested 
m' < f<M', (M-m)' > 0, M(v)-m(v)-m(u)+m(u) > 0, the use of something to parallel the measure spaces of Lebesgue theory. 
So I introduced division spaces, see Chapter 10, and particularly the 
and M-m is strictly increasing for each such m,M, and the upper integral examples given on pp. 219 to 225. Since then, the number of examples of 
over [a,b] is not less than the lower integral. If the Perron integral integrals included in the division space theory has grown considerably, 
exists over [a,b] then M(b)-m(b) < e, for suitable m,M, given € > 0. As | covering practically all integrals that do not use convergence factors 
M(a) = 0 = m(a) with M-m strictly increasing, M(t)-m(t) < e (a < t < b), (smoothing devices) in their definition. Convergence-factor integrals 
and as e > 0 is arbitrary the Perron integral exists over [a,t]. Write it are included in the system of N-variational integrals. For division space 
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theory see, for example, R. Henstock (27), P. Muldowney (46), K.M. 
Ostaszewski (49), and references contained therein. 

For numerical work on integration the simple and dominated integrals 
were introduced in Haber and Shisha (14), (15), and Osgood and Shisha 
(47), (48), respectively. Lewis and Shisha (39) show that these are 
included in the integral of this book. 


22 Notes On The Previous Sections 


In this final section our task is to show the genesis and development 
of the various proofs and other details in the previous sections. Section 
1 and part of Section 2 contain in themselves the required history. 
However, Theorems 2.2, 2.4 seem new, while Theorem 2.5 comes from J.C. 
Burkill (6). Theorems 2.7, 2.8, 2.9 are developments of W.H. Young (58). 
Theorem 2.10 and corollary are a development of E. Helly (19). Theorems 
2.12, 2.13 are obvious and Lemma 2.14 is well known. Theorems 2.15, 2.16 
are due to G.H. Hardy (18), p. 187, and correspond to the Abel and 
Dirichlet tests of convergence of series, and similarly for Theorem 2.18. 
Theorem 2.19 is built on the well known integral test, while for more than 
Theorem 2.20 one may look at A.M. Ostrowski (50). 

In Section 3 we begin with the ordinary calculus and progress to the 
generalized Riemann integral that uses (3.17). After many attempts to 
give a Riemann definition to Lebesgue integration, see H. Lebesgue (34), 
pp. 30-33, E. Borel (3), (4), H. Hahn (16), A. Denjoy (11), (12), B. Levi 
(38), the first systematic efforts seem to be in J. Kurzweil (32), and in 
R. Henstock (22), pp. 277-278, (23), and (24). On 3rd October, 1963, 

K. Kartak informed me of J. Kurzweil's paper, so that until then, Kurzweil 
and myself were working independently. Theorem 3.1 was first given in 

P. Cousin (8), p. 22, for two dimensions. Lusin (42) used the idea for 
trigonometrical series, while W.H. and G.C. Young (59) state the result. 

The first incomplete proofs seem to be in J. Kurzweil (32), p. 423, Lemma 
1.1.1, and R. Henstock (22). Theorem 3.4 is taken from R.O. Davies and 

Z. Schuss (10) while Theorem 3.5 is from R. Henstock (24) pp. 34-35, Theorem 
22.2. Theorem 4.1 is the n-dimensional case of Theorem 3.1, and some 
further theorems are obvious. Note that complex-valued functions are 
integrated by the same definition as real-valued functions, and Theorem 4.5 
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provides a link. The proof of Theorem 4.6 (4.7) can be improved by 
proving (4.8) first, while for Theorem 4.7 see R. Henstock (24), p. 34, 
Theorem 22.1, for Exx. 4.5, 4.6, 4.8, 4.9, see the same book, p. 29, 
Exx. 20.1, 20.2, and for Theorem 5.1 see the book, p. 31, Theorem 21.1. 
Some call the inequality (5.5), Henstock's lemma. But I copied a proof 
of S. Saks (53) for Burkill integration, see R. Henstock (20), (21). My 
contribution was to give the proof in generalized Riemann integration 
and to use it in proofs of limit theorems. Thus a better name is the 
"Saks-Henstock lemma". The neat proof of Theorem 5.1 comes from R. 
Henstock (29), p. 401, Theorem 1. For Theorem 5.3 see the book, p. 40, 
Theorem 24.1. Theorem 5.4 coalesces several similar results, e.g. the 
book, p. 57 Theorem 30.1. For Theorem 5.6 see p. 49, Theorem 28.1. 

For Theorem 5.11 see the book, pp. 54-55, Theorem 28.6. For integration 
by parts, Theorem 5.12, see the book, p. 69, Theorem 33.1. The Cauchy 
extension, Theorem 5.14, is in the book, pp. 113-115, Section 46, and the 
extension to integration over infinite intervals is in the book, pp. 
115-118, Section 47. The Harnack extension, Theorem, 5.15, is called 
the Denjoy extension in the book, pp. 118-120, Section 48. Section 5 

is an important fundamental section, so that its origins need careful 
documentation. 

Section 6 originated from the book, pp. 43-46, Section 25, improved 
by R. Henstock (29). Section 7 on the variation set originated from the 
book, p. 32-33, but has developed considerably since then. 

Section 8, the first on limits under the integral sign, began in the 
book, pp. 82-85, Section 36, while Section 9 began with R. Henstock (28), 
p. 528, Theorem 18, blended with the book, pp. 85-87, Section 37. Section 
10 on controlled convergence is taken from P.Y. Lee and T.S. Chew (35), 
Section 11 is completely new, and Section 12 has much in common with the 
book, pp. 127-140, Chapter 9, E. Hilder (30), H. Minkowski (44), and 
W.H. Young (58). Section 13 translates many results into results for more 
general convergence, while Section 14 looks at Fatou's lemma, following 
R. Henstock (29). Section 15 applies the previous Section 13 to 
differation of integrals, with W. Sierpinski's (55) and Vitali's (57) 
covering theorems, and Section 16 updates the book, pp. 92-98 Section 39. 
There have been many improvements in the proof of Fubini's Theorem 17.2 


197 


since the proof in R. Henstock (23). Theorem 17.3 is due to T.W. Lee REFERENCES 


(37). Sections 18, 19 are new, while Section 20 is a condensation of the (1) Alexandroff, P., ‘Uber die Aquivalenz des Perronschen und des 
book, pp. 148-161. Denjoyschen Integralbegriffes', Math. Zeitschrift 20, 213-222 
(1924). 


(2) Bauer, H., ‘Der Perronsche Integralbegriff und seine Beziehung zum 
Lebegueschen,' Monatsh. Math. Phys. 26, 153-198 (1915). 
| (3) Borel,E., 'Sur la définition de l'intégrale définie', Comptes 

Rendus 150, 375-378 (1910). 

(4) Borel, E., 'Sur une condition générale de l'intégrabilité', Comptes 
Rendus 150, 508-510, (1910). 

(5) Burkill, J.C., 'Functions of intervals', Proceedings London Math. 
Soc. (2) 22, 275-310; (1924). 

(6) Burkill, J.C., 'Differential properties of the Young-Stieltjes 
integrals', Journal London Math. Soc. 23, 22-28 (1948), Math. Reviews 
10-185. 

(7) Cauchy, A.L., Cours d'analyse de 1'École Royale Polytechnique, 
partie; analyse algebrique (1821, Paris), Works (2) 3. 

(8) Cousin, P., 'Sur les fonctions de n variables complexes', Acta 
Math. 19, 1-.... (1894). 

(9) Darboux, J.G. 'Memoire sur les fonctions discontinues', Ann. Sci. 
Ec. Norm. Sup. (2) 4.57-112, (1875). 

(10) Davies, R.O., and Schuss, Z., 'A proof that Henstock's integral 
includes Lebesgue's', Journal London Math. Soc. (2) 2, 561-562 
(1970), Math. Reviews 42 #435. 

(11) Denjoy, A., ‘Sur intégration riemanienne', Comptes Rendus Acad. 
Sci. Paris 169, 219-220 (1919). 

(12) Denjoy, A., ‘Sur la définition riemannienne de l'intégrale de 
Lebesgue', Comptes Rendus Acad. Sci. Paris 193, 695-698 (1931), 
Zent. für Math. 3.106. 

(13) Dollard, J.D., and Friedman, C.N., Product Integration (Addison- 
Wesley, 1979) Math. Reviews 81e:34003. 

(14) Haber, S., and Shisha, 0., ‘An integral related to numerical] 
integration’, Bulletin American Math. Soc. 79 930-932 (1973), Math. 
Reviews 47 #7288. 


are 


198 199 


Haber, S., and Shisha, 0., ‘Improper integrals, simple integrals, 
and numerical quadrature’, Journal Approximation Theory 11, 1-15 
(1974), Math. Reviews 50 #5309. 

Hahn, H., ‘Uber annaherung der Lebesgueschen integrale durch 
Riemannsche summen', Sitzber, Akad. Wiss Wien Abt. 123, 713-743 
(1914). 

Hake, H., ‘Ueber de la Vallee Poussins Ober- und Unterfunktionen 
einfacher Integrale und die Integraldefinition von Perron', Math. 
Annalen 83, 119-142 (1921) 

Hardy, G.H., ‘Notes on some points in the integral calculus (I) 

On the formula for integration by parts', Messenger of Math. 30, 
185-187 (1901). 

Helly, E., Uber lineare Funktionaloperationen', Sitzungsberichte 
der Naturwissenschaftlichen Klasse der Kaiserlichen Akademie der 
Wissenschaften 121, 265-297 (1921). 

Henstock, R., 'On interval functions and their integrals', Journal 
London Math. Soc. 21, 204-209 (1946), Math. Reviews 8, 572. 
Henstock, R., 'On interval functions and their integrals (II 
Journal London Math. Soc. 23, 118-128 (1948), Math. Reviews 1 
239. 

Henstock, R., 'The efficiency of convergence factors for functions 


ar 
10, 

of a continuous real variable', Journal London Math. Soc. 30, 273- 
286 (1955), Math. Reviews 17, 359. 

Henstock, R., 'Definitions of Riemann type of the variational 
integrals', Proceedings London Math. Soc. (3) 11, 402-418 (1961), 
Math. Reviews 24 #A1994. 

Henstock, R., Theory of Integration (Butterworths, London, 1963), 
Math. Reviews 28 #1274. 


Henstock, R., 'The integrability of functions of interval functions’, 
_ Journal London Math. Soc. 39, 589-597 (1964), Math. Reviews 29, 


#5975. 

Henstock, R., ‘A Riemann-type integral of Lebesgue power', Canadian 
Journal of Math. 20, 79-87 (1968), Math. Reviews 36 #2754. 

Henstock, R., Linear Analysis (Butterworths, London, 1968), Math. 
Reviews 54 #7725. 


200 


(32) 


Henstock, R., ‘Generalized integrals of vector-valued functions’, 
Proceedings London Math. Soc. (3) 19, 509-536 (1969), Math, Review 
40 #4420. 

Henstock, R., ‘Generalized Riemann integration and an intrinsic 
topology’, Canadian Journal of Maths. 32, 395-413 (1980), Math. 
Reviews 82b: 26010. 

Hélder, E., ‘Uber einen Mittelwertsatz', Nachr. Akad. Wiss. 
Göttingen (Math. Phys.) 38-47 (1889). 

Kubota, Y., 'A direct proof that the RC-integral is equivalent to 
the D*-integral', Proceedings American Math. Soc. 80, 293-296 (1980), 
Math. Reviews 81h:26006. 

Kurzweil, J., 'Generalized ordinary differential equations and 
continuous dependence on a parameter', Czechoslovak Math. Journal 
7 (82) 418-446 (1957) (especially 422-428), Math. Reviews 22 #2735. 
Lebesgue, H., ' Intégrale, longueur, aire,' Annali Mat. Pura Appl. 
(3) 7.231-359 (1902), Jbuch 33, 307. 

Lebesgue, H., ‘Sur les intégrales singuli@res,' Annis. Fac. Sci: 
Univ. Toulouse (3) 1, 25-117 (1909). 

Lee, P.Y. and Chew, T.S., 'A better convergence theorem for Henstock 
integrals’, Bulletin London Math. Soc. 17 557-564 (1985), Math. 
Reviews 87b:26010. 

Lee, P.Y., and Wittaya, N., 'A direct proof that Henstock and 
Denjoy integrals are equivalent', Bulletin Malaysian Math. Soc. (2) 
5, 43-47 (1982), Math. Reviews 84f:26011. 

Lee, T.W., 'On an extension of Fubini's theorem', Journal London 
Math. Soc. (2) 4, 519-522 (1972), Math. Reviews 46 #315. 

Levi, B., 'Teoria de la Integral de Lebesgue independiente de la 
nocion de Medida', Publ. Inst. Mat. Univ. Nac. Litoral 3, 65-116 
(1941), Math. Reviews 3, 227. 

Lewis, J.T., and Shisha, 0., ‘The generalized Riemann, simple, 
dominated and improper integrals', Journal of Approximation Theory 
38, 192-199 (1983), Math. Reviews 84h:26014. 

Looman, H., ‘Ueber die Perronsche Integraldefinition', Math. 
Annalen 93, 153-156 (1925). 


201 


Lusin, N., ‘Sur les propriétés de l'intégrale de M. Denjoy', Comptes (57) Vitali, G., ‘Sui gruppi di punti e sulle funzioni di variabili 


Rendus Acad. Sci. Paris 155, 1475-1478 (1912). reali’, Atti Accad. Sci. Torino 43, 75-92 (1908). 

Lusin, N., Integrals and trigonometric series (in Russian), Moscow, (58) Young, W.H., ‘On classes of summable functions and their Fourier 
1915. series', Proc. Roy. Soc. series A.87, 225-229 (1912). 

McShane, E.J., Integration (Princeton University Press, 1944), Math. (59) Young, W.H. and Young, G.C., 'On the reduction of sets of intervals’, 
Reviews 6, 43. Proceedings London Math. Soc. (2) 14, 111-130 (1915). 

Minkowski, H., Geometrie der Zahlen, I, 115-117 (1896). 

Moore, E.H., ‘Definition of limit in general integral analysis’, Cameron, R.H. and Martin, W.T., 'Mm unsymmetrical Fubini theorem', 

Proc. Natn. Acad. Sci. U.S.A., 1, 628-632 (1915). Bulletin American Math. Soc. 47, 121-125 (1941), Math. ‘Reviews 2; 257. 
Muldowney, P., A general theory of integration in function spaces, Denjoy, A., ‘Une extension de l'intégrale de M. Lebesgue’, C.R. Acad. 
Pitman Research Notes in Maths. 153 (Longmans, 1987). Sci. Paris 154, 854-862 (1912). 

Osgood, C.F., and Shisha, 0., ‘The dominated integral', Journal of Denjoy, A., 'Calcul de la primitive de la fonction dérivée la plus 
Approximation Theory 17, 150-165 (1976), Math. Reviews 54 #6467. générale', C.R. Acad. Sci. Paris, Ser. A.B, 154, 1075-1078 (1912) 

Osgood, C.F., and Shisha, 0., ‘Numerical quadrature of improper Harnack, A., 'Die allgemeinen Sätze Uber den zusammenhang der Funktionen 
integrals and the dominated integral’, Journal of Approximation einer reelen Variabeln mit ihren Ableitungen, II, Math. Annalen 24, 217- 
Theory 20, 139-152 (1977), Math. Reviews 56, #7128. 252 (1884). 

Ostaszewski, K.M., 'Henstock integration in the plane’, Memoirs McShane, E.J., Unified Integration (Academic Press, 1983), Math. Reviews 
American Math. Soc. 63 no. 353 (1986), Math. Reviews 87j:26016. 36c: 28002 

Ostrowski, A.M., ‘On Cauchy-Frullani integrals', Comment. Math. Neyman , J., and Pearson, E.S., 'On the problem of the most efficient 
Helvetici 51, 57-91 (1976), Math. Reviews 53 #8347. tests of statistical hypotheses’, Phil. Trans. A231, 289-337 (1933), 
Perron, 0., ‘Ueber den Integralbegriff', Sitzber. -B.Heidelberg Zent. für math. 6, 268 

Akad. Wiss., Abt. A 16, 1-16 (1914). Robbins, H., ‘Mixture of two distributions’, Annals of Math. Statistics 
Riemann, G.F.B., ‘Uber die Darstellbarkeit einer Function durch eine 19, 360-369 (1948), Math. Reviews 10, 103 

trigonometrische Reihe', Abh. Gesell. Wiss. Göttingen 13, math. kl. de la Vallée Poussin, C., ‘Etude des intégrales a limites infinies pour 
87-132 (1868), Oeuvres mathematiques de Riemann (Ed. L. Laugel) lesquelles la fonction sous le signe est continue', Annales de la Société 
(1898, Paris: reprinted 1968, Paris and Cleveland). Math. Reviews Scientifique de Bruxelles, 16, 150-180 (1892) 

36 #4952. _ 


Saks, S., ‘Sur les fonctions d'intervalle', Fundamenta Math. 10 
211-224 (p. 214) (1927), dbuch 53, 233. 

Saks, S., Theory of the Integral, 2nd. English edition, Warsaw, 
1937, Zent. fur Math. 17, 300. 

Sierpinski, W., ‘Un lemme métrique', Fundamenta Math. 4, 201-203 
(1923). 

Stieltjes, T.J., ‘Recherches sur les fractions continues’, Annales 
de la Faculte des Sciences de Toulouse 8, 1-122 (1894). 


202 203 


Abel, N.H., 20-22, 25 

absolute continuity 62, 93 

absolute integral 42, 93 

almost everywhere 55 

Archimedes 1 

Arzéla, C. 178 

Barnard, G.A. 183 

bounded variation 5, 55 

brick 41 

Burkill, J.C. 5, 18, 39, 162 

Cantor diagonal process 48 

Cantor's ternary set 25, 74 

Carathéodory, C. 156, 157, 176, 
179 

Cauchy, A.L. 3, 19 

Cauchy limit 19, 20, 22, 67, 
68, 108 

Cauchy sequence 9 

Gesaro-Perron integral 39 

characteristic function 188 

classification 180 

compatible 32 

complete space 109 

convolution 188 

correlation ratio 188 

countably additive 159 

Darboux, J.G. 1, 30 

6-fine 32, 33, 41 

§ shrinks 32 

density theorem 148 

Descartes, R. 1 

Dirichlet, J.P.G.L. 20-22, 25 

distribution function 190 


INDEX 


division 41 
Dollard, J.D. 5 

error of first kind 186 

error of second kind 186 

Eudoxus 1 

Fatou, P. 110, 120, 121, 129, 131 
finitely additive 51 

finitely subadditive 51 

finitely superadditive 51 


fonction des sauts 14 

Friedman, C.N., 5 

Frullani 23 

Fubini, G. 163, 190 

fundamental(E) 44, 52 

fundamental sequence 9 

generalized absolute continuity 
(ACG*) 62 

generalized bounded variation 
(VBG*) 57 

generalized Riemann integral 29} 38.5 
35, 36, 38-40, 42, 48, 72. 

generating function 188 

Hardy, G.H. 21, 22 

Harnack, A. 67, 69 

Henstock, R. 5 

Hölder's inequality 79, 111-113, 117, 
121 

Huygens, C. 1 

independence 185 

integral test 23 

integration by parts 64 

integration by substitution 59 

integration over infinite bricks 69 


204 


inverse function 12 

Jordan, C. 133 

Jordan decomposition 85 

jump function 14 

Kollektiv 182 

Lebesgue, H. and integral 3, 
35-38, 58, 63, 67, 72, 148, 160, 
162, 178 

Lebesgue-Stieltjes integral 7 

Leibnitz, G.W. 1 

level of significance 184 

Levy, B. 90 

limiting variation set 83 

lower Darboux sum 2 

McGrotty, J.J. 147 

McShane, E.J. 32 

mean 188 

mean convergence 109, 121 

measurable 154-157 

mesh 1 

Minkowski's inequality 112, 114 

von Mises 182 

moduli of continuity 8 

moment 188 

moment generating function 188 

Moore, E.H. 17 

Muldowney, P.J. 89 

multiplicative axiom 129 

Newton, I. and integral, 1, 29-31, 
35, 38 

non-absolute integral 42, 57 

null function 35, 36 

outer measure 58 

partial division 83 

partial set 51 


205 


partial sum 83 

partition 41 

Pearson, K. 188 

Perron, 0., and integral 39, 67 

probability 182 

proper partial set 51 

Radon, J. 7 

rational belief 184 

refinement integral 17-19, 39, 47 

refinement of a partition 17 

regular measure 63 

Riemann, G.F.B., and integral 1-3, 
5 18 19:5 24, 22, 30, 315-35; 48 

Riemann-complete integral 29 

Riemann-Darboux integral 2-4 

Riemann-Darboux-Stieltjes integral 5 

Riemann product integral 5 

Riemann-Stieltjes integral 5-7, 
17-19, 21, 39, 153 

Riemann sum 32 

Sierpinski, W. 

o-integral 17 

significant 185 

special Denjoy integral 39, 67, 72 

specification 184 


136, 140 


standard deviation 188 

step function 149 

Stieltjes, T.J. 3 

Tonelli, L. 169 

upper Darboux sum 2 

de la Vallée Poussin, C. 133 
variance 188 

variation 5, 54, 58 
variation set 83 

variation zero 55 


variational integral 54 
vertex 41 

Vitali, G. 91, 101, 136, 143 
volume 42 

Young, W.H. 12, 25, 115, 162 
Young's inequality 14, 111 
Zorn's lemma 129 


206 


NOILVYSDALNI JO AYOJHL JHL NO SAHYNLIAT 


LECTURES ON 
THE THEORY OF 
INTEGRATION 


Ralph Henstock 


World Scientific 


